HyperAI

Main

GPU

Console
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Dataset Help

Products

News Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Dataset Help

Products

News Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Yucheng Hu, Jianke Zhang, Yuanfei Luo, et al.

THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

Multimodal Representation

Martin N Hebart Oliver Contier, Lina Teichmann, Adam H Rockter, et al.

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

Isomorphic Labs Team

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Reinforcement Learning

Peng Xia, Jianwen Chen, Hanyang Wang, et al.

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Diffusion Model

Tiwei Bie, Maosong Cao, Xiang Cao, et al.

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Diffusion Model

Image Generation

Yunze Tong, Mushui Liu, Canyu Zhao, et al.

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Yalcin Tur, Jalal Naghiyev, Haoquan Fang, et al.

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Jun Han, Shuo Zhang, Wei Li, et al.

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Multimodal Representation

Xiaomin Yu, Yi Xin, Wenjie Zhang, et al.

MOVA: Towards Scalable and Synchronized Video-Audio Generation

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Video Generation

SII-OpenMOSS Team, Donghua Yu, Mingshu Chen, et al.

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

Ajay Jaiswal, Lauren Hannah, Han-Byul Kim, et al.

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Video Understanding

Shenyuan Gao, William Liang, Kaiyuan Zheng, et al.

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Reinforcement Learning

Daniil Plyusov, Alexey Gorbatovski, Boris Shaposhnikov, et al.

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Lianhai Ren, Yucheng Ding, Xiao Liu, et al.

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Audio and Speech Processing

Georgii Aparin, Tasnima Sadekova, Alexey Rukhovich, et al.

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Reinforcement Learning

Shumin Wang, Yuexiang Xie, Wenhao Zhang, et al.

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Fangzhi Xu, Hang Yan, Qiushi Sun, et al.

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Baichuan-M3 Team, Chengfeng Dou, Fan Yang, et al.

Generative Modeling via Drifting

Generative Modeling via Drifting

Diffusion Model

Image Generation

Mingyang Deng, He Li, Tianhong Li, Kaiming He

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Text Generation

Junfeng Fang, Houcheng Jiang, Kun Wang, et al.

Learning to Reason in 13 Parameters

Learning to Reason in 13 Parameters

Intelligent Question Answering

John X. Morris, Niloofar Mireshghallah, Mark Ibrahim, et al.

DFlash: Block Diffusion for Flash Speculative Decoding

DFlash: Block Diffusion for Flash Speculative Decoding

Diffusion Model

Jian Chen, Yesheng Liang, Zhijian Liu

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Video Generation

Diffusion Model

Shuo Chen, Cong Wei, Sun Sun, et al.

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Haozhen Zhang, Quanyu Long, Jianzhu Bao, et al.

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Reinforcement Learning

Fanfan Liu, Youyang Yin, Peng Shi, et al.

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Zhenxiong Yu, Zhi Yang, Zhiheng Jin, et al.

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Johannes Kirmayr, Lukas Stappen, Elisabeth André

WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

Diffusion Model

Aiwei Liu, Minghua He, Shaoxun Zeng, et al.

Fun-ASR Technical Report

Fun-ASR Technical Report

Audio Recognition

Keyu An, Yanni Chen, Zhigao Chen, et al.

Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

David P. Woodruff, Vincent Cohen-Addad, Lalit Jain, et al.

Scaling Small Agents Through Strategy Auctions

Scaling Small Agents Through Strategy Auctions

Lisa Alazraki, William F. Shen, Yoram Bachrach, et al.

Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration

Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration

Jiaheng Liu, Yuanxing Zhang, Shihao Li, et al.

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Yucheng Hu, Jianke Zhang, Yuanfei Luo, et al.

THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior

Multimodal Representation

Martin N Hebart Oliver Contier, Lina Teichmann, Adam H Rockter, et al.

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

Accurate Predictions of Novel Biomolecular Interactions with IsoDDE

Isomorphic Labs Team

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Reinforcement Learning

Peng Xia, Jianwen Chen, Hanyang Wang, et al.

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Diffusion Model

Tiwei Bie, Maosong Cao, Xiang Cao, et al.

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Diffusion Model

Image Generation

Yunze Tong, Mushui Liu, Canyu Zhao, et al.

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Yalcin Tur, Jalal Naghiyev, Haoquan Fang, et al.

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Jun Han, Shuo Zhang, Wei Li, et al.

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Multimodal Representation

Xiaomin Yu, Yi Xin, Wenjie Zhang, et al.

MOVA: Towards Scalable and Synchronized Video-Audio Generation

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Video Generation

SII-OpenMOSS Team, Donghua Yu, Mingshu Chen, et al.

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

Ajay Jaiswal, Lauren Hannah, Han-Byul Kim, et al.

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Video Understanding

Shenyuan Gao, William Liang, Kaiyuan Zheng, et al.

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Reinforcement Learning

Daniil Plyusov, Alexey Gorbatovski, Boris Shaposhnikov, et al.

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Lianhai Ren, Yucheng Ding, Xiao Liu, et al.

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Audio and Speech Processing

Georgii Aparin, Tasnima Sadekova, Alexey Rukhovich, et al.

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Reinforcement Learning

Shumin Wang, Yuexiang Xie, Wenhao Zhang, et al.

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Fangzhi Xu, Hang Yan, Qiushi Sun, et al.

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Baichuan-M3 Team, Chengfeng Dou, Fan Yang, et al.

Generative Modeling via Drifting

Generative Modeling via Drifting

Diffusion Model

Image Generation

Mingyang Deng, He Li, Tianhong Li, Kaiming He

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Text Generation

Junfeng Fang, Houcheng Jiang, Kun Wang, et al.

Learning to Reason in 13 Parameters

Learning to Reason in 13 Parameters

Intelligent Question Answering

John X. Morris, Niloofar Mireshghallah, Mark Ibrahim, et al.

DFlash: Block Diffusion for Flash Speculative Decoding

DFlash: Block Diffusion for Flash Speculative Decoding

Diffusion Model

Jian Chen, Yesheng Liang, Zhijian Liu

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Video Generation

Diffusion Model

Shuo Chen, Cong Wei, Sun Sun, et al.

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Haozhen Zhang, Quanyu Long, Jianzhu Bao, et al.

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Reinforcement Learning

Fanfan Liu, Youyang Yin, Peng Shi, et al.

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Zhenxiong Yu, Zhi Yang, Zhiheng Jin, et al.

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Johannes Kirmayr, Lukas Stappen, Elisabeth André

WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

Diffusion Model

Aiwei Liu, Minghua He, Shaoxun Zeng, et al.

Fun-ASR Technical Report

Fun-ASR Technical Report

Audio Recognition

Keyu An, Yanni Chen, Zhigao Chen, et al.

Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

David P. Woodruff, Vincent Cohen-Addad, Lalit Jain, et al.

Scaling Small Agents Through Strategy Auctions

Scaling Small Agents Through Strategy Auctions

Lisa Alazraki, William F. Shen, Yoram Bachrach, et al.

Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration

Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration

Jiaheng Liu, Yuanxing Zhang, Shihao Li, et al.