Date

6 months ago

Native Sparse Attention (NSA) is a native trainable sparse attention mechanism proposed by DeepSeek, Peking University, and the University of Washington on February 27, 2025. It aims to solve the computational bottleneck problem in long sequence modeling. This method combines algorithmic innovation with hardware optimization to achieve efficient long-context modeling.Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention", which won the ACL 25 Best Paper Award.

Pre-trained on a 27B-parameter Transformer backbone model, NSA achieves comparable or better performance than fully connected attention models on common benchmarks, long-context tasks, and inference tasks. When processing 64k-length sequences, NSA achieves significant speedups in decoding, forward propagation, and backpropagation.

Related Wiki

Gated Attention

The Tongyi Qianwen team systematically studied the role of gating mechanisms in standard softmax attention.

2 months ago

SERES Semantic Aware Sparse View Reconstruction Framework

As a novel semantic-aware framework, it is used to reconstruct 3D models from sparse views.

2 months ago

Multi-agent Workflow CudaForge

CudaForge is a simple, effective, and low-cost multi-agent workflow for CUDA kernel generation and optimization.

2 months ago

RewardMap, a multi-stage Reinforcement Learning Framework

RewardMap enhances the capabilities of multimodal large language models in structured vision tasks.

2 months ago

Layout Control Framework InstanceAssemble

InstanceAssemble enables high-quality and controllable image generation under multimodal conditions.

2 months ago

FlashMoBA

FlashMoBA makes the theoretically optimal block size practical, achieving up to 14.7x speedup on GPUs.

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

6 months ago

Related Wiki

Gated Attention

The Tongyi Qianwen team systematically studied the role of gating mechanisms in standard softmax attention.

2 months ago

SERES Semantic Aware Sparse View Reconstruction Framework

As a novel semantic-aware framework, it is used to reconstruct 3D models from sparse views.

2 months ago

Multi-agent Workflow CudaForge

CudaForge is a simple, effective, and low-cost multi-agent workflow for CUDA kernel generation and optimization.

2 months ago

RewardMap, a multi-stage Reinforcement Learning Framework

RewardMap enhances the capabilities of multimodal large language models in structured vision tasks.

2 months ago

Layout Control Framework InstanceAssemble

InstanceAssemble enables high-quality and controllable image generation under multimodal conditions.

2 months ago

FlashMoBA

FlashMoBA makes the theoretically optimal block size practical, achieving up to 14.7x speedup on GPUs.

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Native Sparse Attention

Build AI with AI

HyperAI Newsletters

Command Palette

Native Sparse Attention

Related Wiki

Gated Attention

SERES Semantic Aware Sparse View Reconstruction Framework

Multi-agent Workflow CudaForge

RewardMap, a multi-stage Reinforcement Learning Framework

Layout Control Framework InstanceAssemble

FlashMoBA

Build AI with AI

HyperAI Newsletters

Command Palette

Native Sparse Attention

Related Wiki

Gated Attention

SERES Semantic Aware Sparse View Reconstruction Framework

Multi-agent Workflow CudaForge

RewardMap, a multi-stage Reinforcement Learning Framework

Layout Control Framework InstanceAssemble

FlashMoBA

Build AI with AI

HyperAI Newsletters

Related Wiki

Gated Attention

SERES Semantic Aware Sparse View Reconstruction Framework

Multi-agent Workflow CudaForge

RewardMap, a multi-stage Reinforcement Learning Framework

Layout Control Framework InstanceAssemble

FlashMoBA

Related Wiki

Gated Attention

SERES Semantic Aware Sparse View Reconstruction Framework

Multi-agent Workflow CudaForge

RewardMap, a multi-stage Reinforcement Learning Framework

Layout Control Framework InstanceAssemble

FlashMoBA