Date

2 months ago

Paper URL

Tags

Deep Aligned Visual Safety Prompt (DAVSP) was proposed by a research team from Tsinghua University in November 2025, and the relevant research results were published in the paper "DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt"It has been accepted by AAAI 2026".

DAVSP is a novel secure alignment method for large-scale language vision models (LVLMs), effectively improving the resistance of LVLMs to malicious queries while retaining their practicality for harmless queries. This method constructs a trainable padding region around the input image as a visual security cue, preserving the original visual features and eliminating the performance bottleneck caused by pixel perturbations, thus achieving a paradigm shift through visual security cues (VSP). The research also proposes a novel training strategy called Deep Alignment (DA). Based on the observation that LVLMs inherently encode harmful information in their activation space, the researchers construct a harmful vector that captures the semantic direction in the model's internal representation that distinguishes between malicious and benign queries.

Related Wiki

Decomposed Forward Pass (DePass)

DePass is used to interpret the Transformer model by decomposing the forward pass.

a month ago

iSeal Fingerprint Recognition Method

iSeal achieves a 100% fingerprint success rate (FSR) against more than 10 attacks on 12 LLMs.

a month ago

Sparse Code Tree Decoding Tree Sketching

By leveraging GPU parallelism to efficiently expand the decoding tree, fast and scalable optimization of the inference path is achieved.

a month ago

SoCE Class Expert Soup

SoCE is a model optimization paradigm based on an automatic category-aware expert selection mechanism and combined with multiple benchmark tasks.

a month ago

WorldGen

WorldGen is capable of creating geometrically unified, visually rich, and highly efficient real-time rendering worlds.

a month ago

Dense Retriever

The dense search engine is responsible for quickly finding the paragraphs most relevant to the query semantics from a massive document library, and is the core foundational component of the search enhancement generation system.

24 days ago

Skills

Skills are reusable capability modules that encapsulate knowledge and processes, enabling AI to transform from general-purpose models into specialized intelligent agents.

a month ago

Guided Thought Reinforcement

GTR can guide model reasoning in complex visual environments and prevent "brain breakdown".

24 days ago

Model Souping

Model Souping can generate a better model by averaging the weights of multiple fine-tunings.

a month ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

2 months ago

Paper URL

2506.09353

Related Wiki

Decomposed Forward Pass (DePass)

DePass is used to interpret the Transformer model by decomposing the forward pass.

a month ago

iSeal Fingerprint Recognition Method

iSeal achieves a 100% fingerprint success rate (FSR) against more than 10 attacks on 12 LLMs.

a month ago

Sparse Code Tree Decoding Tree Sketching

By leveraging GPU parallelism to efficiently expand the decoding tree, fast and scalable optimization of the inference path is achieved.

a month ago

SoCE Class Expert Soup

SoCE is a model optimization paradigm based on an automatic category-aware expert selection mechanism and combined with multiple benchmark tasks.

a month ago

WorldGen

WorldGen is capable of creating geometrically unified, visually rich, and highly efficient real-time rendering worlds.

a month ago

Dense Retriever

24 days ago

Skills

Skills are reusable capability modules that encapsulate knowledge and processes, enabling AI to transform from general-purpose models into specialized intelligent agents.

a month ago

Guided Thought Reinforcement

GTR can guide model reasoning in complex visual environments and prevent "brain breakdown".

24 days ago

Model Souping

Model Souping can generate a better model by averaging the weights of multiple fine-tunings.

a month ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Safety Comparison Method: Deep Aligned Visual Safety Prompt

Build AI with AI

HyperAI Newsletters

Command Palette

Safety Comparison Method: Deep Aligned Visual Safety Prompt

Related Wiki

Decomposed Forward Pass (DePass)

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

SoCE Class Expert Soup

WorldGen

Dense Retriever

Skills

Guided Thought Reinforcement

Model Souping

Build AI with AI

HyperAI Newsletters

Command Palette

Safety Comparison Method: Deep Aligned Visual Safety Prompt

Related Wiki

Decomposed Forward Pass (DePass)

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

SoCE Class Expert Soup

WorldGen

Dense Retriever

Skills

Guided Thought Reinforcement

Model Souping

Build AI with AI

HyperAI Newsletters

Related Wiki

Decomposed Forward Pass (DePass)

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

SoCE Class Expert Soup

WorldGen

Dense Retriever

Skills

Guided Thought Reinforcement

Model Souping

Related Wiki

Decomposed Forward Pass (DePass)

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

SoCE Class Expert Soup

WorldGen

Dense Retriever

Skills

Guided Thought Reinforcement

Model Souping