HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler

Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler

Zheng Size, Wenlei Bao, Qi Hou, et al.

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Diffusion Model

Shengbang Tong, Boyang Zheng, Ziteng Wang, et al.

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Multimodal Representation

Shijie Lian, Bin Yu, Xiaopeng Lin, et al.

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Diffusion Model

Zanlin Ni, Shenzhi Wang, Yang Yue, et al.

LLM-in-Sandbox Elicits General Agentic Intelligence

Daixuan Cheng, Shaohan Huang, Yuxian Gu, et al.

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Video Understanding

Video Processing

Haowei Zhang, Shudong Yang, Jinlan Fu, et al.

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Taofeng Xue, Chong Peng, Mianqiu Huang, et al.

HY-MT1.5 Technical Report

Mao Zheng, Zheng Li, Tao Chen, et al.

Scaling Laws for Code: Every Programming Language Matters

Code Generation

Jian Yang, Shawn Guo, Lin Jing, et al.

Qwen3-TTS Technical Report

Audio and Speech Processing

Hangrui Hu, Xinfa Zhu, Ting He, et al.

Small Models, Big Results: Achieving Superior Intent Extraction through Decomposition

Human-Computer Interaction

Danielle Cohen, Yoni Halpern, Noam Kahlon, et al.

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Zhi Yang, Runguo Li, Qiqi Qiang, et al.

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Peizhou Huang, Zixuan Zhong, Zhongwei Wan, et al.

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Shengda Fan, Xuyan Ye, Yankai Lin

Rethinking Video Generation Model for the Embodied World

Video Generation

Embodied Intelligence

Yufan Deng, Zilin Pan, Hongyu Zhang, et al.

Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

Retrieval-Augmented Generation

Qianli Ma, Chang Guo, Zhiheng Tian, et al.

Agentic Reasoning for Large Language Models

Tianxin Wei, Ting-Wei Li, Zhining Liu, et al.

PERSONAPLEX: VOICE AND ROLE CONTROL FOR FULL DUPLEX CONVERSATIONALSPEECH MODELS

Audio and Speech Processing

Rajarshi Roy, Jonathan Raiman, Sang-gil Lee, et al.

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Tanyu Chen, Tairan Chen, Kai Shen, et al.

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Preference Modeling

Zecheng Tang, Baibei Ji, Ruoxi Sun, et al.

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Video Generation

Pengze Zhang, Yanze Wu, Mengtian Li, et al.

Toward Efficient Agents: Memory, Tool learning, and Planning

Xiaofang Yang, Lijun Li, Heng Zhou, et al.

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Qian Chen, Jinlan Fu, Changsong Li, et al.

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

Embodied Intelligence

Hao Luo, Ye Wang, Wanpeng Zhang, et al.

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Caihua Li, Lianghong Guo, Yanlin Wang, et al.

Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision

Supervised Fine-Tuning

Wei Du, Shubham Toshniwal, Branislav Kisacanin, et al.

Building Production-Ready Probes For Gemini

Text Generation

János Kramár, Joshua Engels, Zheng Wang, et al.

LFM2 Technical Report

Retrieval-Augmented Generation

Alexander Amini, Anna Banaszak, Harold Benoit, et al.

CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation

Shuai Tan, Biao Gong, Ke Ma, et al.

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Supervised Fine-Tuning

Christina Lu, Jack Gallagher, Jonathan Michala, et al.

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Jie Yang, Honglin Guo, Li Ji, et al.

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Yao Tang, Li Dong, Yaru Hao, et al.

Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler

Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler

Zheng Size, Wenlei Bao, Qi Hou, et al.

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Diffusion Model

Shengbang Tong, Boyang Zheng, Ziteng Wang, et al.

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Multimodal Representation

Shijie Lian, Bin Yu, Xiaopeng Lin, et al.

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Diffusion Model

Zanlin Ni, Shenzhi Wang, Yang Yue, et al.

LLM-in-Sandbox Elicits General Agentic Intelligence

Daixuan Cheng, Shaohan Huang, Yuxian Gu, et al.

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Video Understanding

Video Processing

Haowei Zhang, Shudong Yang, Jinlan Fu, et al.

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Taofeng Xue, Chong Peng, Mianqiu Huang, et al.

HY-MT1.5 Technical Report

Mao Zheng, Zheng Li, Tao Chen, et al.

Scaling Laws for Code: Every Programming Language Matters

Code Generation

Jian Yang, Shawn Guo, Lin Jing, et al.

Qwen3-TTS Technical Report

Audio and Speech Processing

Hangrui Hu, Xinfa Zhu, Ting He, et al.

Small Models, Big Results: Achieving Superior Intent Extraction through Decomposition

Human-Computer Interaction

Danielle Cohen, Yoni Halpern, Noam Kahlon, et al.

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Zhi Yang, Runguo Li, Qiqi Qiang, et al.

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Peizhou Huang, Zixuan Zhong, Zhongwei Wan, et al.

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Shengda Fan, Xuyan Ye, Yankai Lin

Rethinking Video Generation Model for the Embodied World

Video Generation

Embodied Intelligence

Yufan Deng, Zilin Pan, Hongyu Zhang, et al.

Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

Retrieval-Augmented Generation

Qianli Ma, Chang Guo, Zhiheng Tian, et al.

Agentic Reasoning for Large Language Models

Tianxin Wei, Ting-Wei Li, Zhining Liu, et al.

PERSONAPLEX: VOICE AND ROLE CONTROL FOR FULL DUPLEX CONVERSATIONALSPEECH MODELS

Audio and Speech Processing

Rajarshi Roy, Jonathan Raiman, Sang-gil Lee, et al.

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Tanyu Chen, Tairan Chen, Kai Shen, et al.

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Preference Modeling

Zecheng Tang, Baibei Ji, Ruoxi Sun, et al.

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Video Generation

Pengze Zhang, Yanze Wu, Mengtian Li, et al.

Toward Efficient Agents: Memory, Tool learning, and Planning

Xiaofang Yang, Lijun Li, Heng Zhou, et al.

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Qian Chen, Jinlan Fu, Changsong Li, et al.

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

Embodied Intelligence

Hao Luo, Ye Wang, Wanpeng Zhang, et al.

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Caihua Li, Lianghong Guo, Yanlin Wang, et al.

Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision

Supervised Fine-Tuning

Wei Du, Shubham Toshniwal, Branislav Kisacanin, et al.

Building Production-Ready Probes For Gemini

Text Generation

János Kramár, Joshua Engels, Zheng Wang, et al.

LFM2 Technical Report

Retrieval-Augmented Generation

Alexander Amini, Anna Banaszak, Harold Benoit, et al.

CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation

Shuai Tan, Biao Gong, Ke Ma, et al.

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Supervised Fine-Tuning

Christina Lu, Jack Gallagher, Jonathan Michala, et al.

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Jie Yang, Honglin Guo, Li Ji, et al.

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Yao Tang, Li Dong, Yaru Hao, et al.

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

LLM-in-Sandbox Elicits General Agentic Intelligence

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

HY-MT1.5 Technical Report

Scaling Laws for Code: Every Programming Language Matters

Qwen3-TTS Technical Report

Small Models, Big Results: Achieving Superior Intent Extraction through Decomposition

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Rethinking Video Generation Model for the Embodied World

Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

Agentic Reasoning for Large Language Models

PERSONAPLEX: VOICE AND ROLE CONTROL FOR FULL DUPLEX CONVERSATIONALSPEECH MODELS

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Toward Efficient Agents: Memory, Tool learning, and Planning

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision

Building Production-Ready Probes For Gemini

LFM2 Technical Report

CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

LLM-in-Sandbox Elicits General Agentic Intelligence

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

HY-MT1.5 Technical Report

Scaling Laws for Code: Every Programming Language Matters

Qwen3-TTS Technical Report

Small Models, Big Results: Achieving Superior Intent Extraction through Decomposition

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Rethinking Video Generation Model for the Embodied World

Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

Agentic Reasoning for Large Language Models

PERSONAPLEX: VOICE AND ROLE CONTROL FOR FULL DUPLEX CONVERSATIONALSPEECH MODELS

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Toward Efficient Agents: Memory, Tool learning, and Planning

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision

Building Production-Ready Probes For Gemini

LFM2 Technical Report

CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge