HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

Unlimited OCR Works: Welcome the Era of One-shot Long-horizon Parsing

Unlimited OCR Works: Welcome the Era of One-shot Long-horizon Parsing

Baoding Zhou, Jingyun Wang, Xiaolin Wei, et al.

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Jiayu Liu, Qihan Lin, Cheng Qian, et al.

OpenRath: Session-Centered Runtime State for Agent Systems

Fukang Wen, Zhijie Wang, Ruilin Xu

EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

Retrieval-Augmented Generation

Chang Nie, Chaoyou Fu, Junlan Feng, et al.

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

Reinforcement Learning

Zhilin Huang, Hang Gao, Ziqiang Dong, et al.

World Action Models: A Survey

Video Generation

Qiuhong Shen, Shihua Zhang, Yue Liao, et al.

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

Xinping Zhao, Jiaxin Xu, Ziqi Dai, et al.

Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe

Qian Zhao, Kunlong Chen, Changxin Tian, et al.

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Zhentao Tan, Wei Chen, Jingyi Shen, et al.

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

Code Generation

Yipeng Gao, Lei Shu, Genzhi Ye, et al.

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

Medical Imaging

Visual Question Answering

Leo Butsanets, Charles Corbiere, Julien Khlaut, et al.

Training Software Engineering Agents and Verifiers with SWE-Gym

Supervised Fine-Tuning

Jiayi Pan, Xingyao Wang, Graham Neubig, et al.

MAKIEVAL: A Multilingual Automatic WiKIdata-based Framework for Cultural Awareness Evaluation for LLMs

Text Generation

Raoyuan Zhao, Beiduo Chen, Barbara Plank, et al.

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

3D Machine Vision

Retrieval-Augmented Generation

Haoyu Wang, Guoqing Ma, Zeyu Zhang, et al.

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

Diffusion Model

Text Generation

Yanming Zhang, Yihan Bian, Jingyuan Qi, et al.

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

Diffusion Model

Max Van Puyvelde, Ibrahim Gulluk, Wim Van Criekinge, et al.

GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

Zhe Ren, Yibo Yang, Yimeng Chen, et al.

MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision

Ye Jin, Yangyang Xu, Jun Zhu, et al.

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Diffusion Model

Image Captioning

Yueyi Sun, Yuhao Wang, Jason Li, et al.

Code World Models for General Game Playing

Code Generation

Wolfgang Lehrach, Daniel Hennes, Miguel Lázaro-Gredilla, et al.

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Dhaval C. Patel, Kaoutar El Maghraoui, Shuxin Lin, et al.

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Video Understanding

Yalun Dai, Hao Li, Shulin Tian, et al.

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Code Generation

Maria Ivanova, Pavel Zadorozhny, Rodion Levichev, et al.

Playful Agentic Robot Learning

Code Generation

Junyi Zhang, Jiaxin Ge, Hanjun Yoo, et al.

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Tianshan Zhang, Yijia Duan, Yanjun Li, et al.

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

Image Inpainting

Diffusion Model

Kangsheng Duan, Ziyang Xu, Wenyu Liu, et al.

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Reinforcement Learning

Minseo Kim, Minjae Lee, Seunghyuk Oh, et al.

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Jingyuan Huang, Zuming Huang, Yucheng Shi, et al.

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

3D Machine Vision

Yatai Ji, An-Chieh Cheng, Yang Fu, et al.

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Mingyue Cui, Linghui Shen, Xingyi Yang

Kairos: A Native World Model Stack for Physical AI

Kairos Team, Fei Wang, Shan You, et al.

Guava: An Effective and Universal Harness for Embodied Manipulation

Embodied Intelligence

Haowen Liu, Xirui Li, Shaoxiong Yao, et al.

Unlimited OCR Works: Welcome the Era of One-shot Long-horizon Parsing

Unlimited OCR Works: Welcome the Era of One-shot Long-horizon Parsing

Baoding Zhou, Jingyun Wang, Xiaolin Wei, et al.

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Jiayu Liu, Qihan Lin, Cheng Qian, et al.

OpenRath: Session-Centered Runtime State for Agent Systems

Fukang Wen, Zhijie Wang, Ruilin Xu

EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

Retrieval-Augmented Generation

Chang Nie, Chaoyou Fu, Junlan Feng, et al.

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

Reinforcement Learning

Zhilin Huang, Hang Gao, Ziqiang Dong, et al.

World Action Models: A Survey

Video Generation

Qiuhong Shen, Shihua Zhang, Yue Liao, et al.

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

Xinping Zhao, Jiaxin Xu, Ziqi Dai, et al.

Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe

Qian Zhao, Kunlong Chen, Changxin Tian, et al.

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Zhentao Tan, Wei Chen, Jingyi Shen, et al.

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

Code Generation

Yipeng Gao, Lei Shu, Genzhi Ye, et al.

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

Medical Imaging

Visual Question Answering

Leo Butsanets, Charles Corbiere, Julien Khlaut, et al.

Training Software Engineering Agents and Verifiers with SWE-Gym

Supervised Fine-Tuning

Jiayi Pan, Xingyao Wang, Graham Neubig, et al.

MAKIEVAL: A Multilingual Automatic WiKIdata-based Framework for Cultural Awareness Evaluation for LLMs

Text Generation

Raoyuan Zhao, Beiduo Chen, Barbara Plank, et al.

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

3D Machine Vision

Retrieval-Augmented Generation

Haoyu Wang, Guoqing Ma, Zeyu Zhang, et al.

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

Diffusion Model

Text Generation

Yanming Zhang, Yihan Bian, Jingyuan Qi, et al.

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

Diffusion Model

Max Van Puyvelde, Ibrahim Gulluk, Wim Van Criekinge, et al.

GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

Zhe Ren, Yibo Yang, Yimeng Chen, et al.

MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision

Ye Jin, Yangyang Xu, Jun Zhu, et al.

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Diffusion Model

Image Captioning

Yueyi Sun, Yuhao Wang, Jason Li, et al.

Code World Models for General Game Playing

Code Generation

Wolfgang Lehrach, Daniel Hennes, Miguel Lázaro-Gredilla, et al.

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Dhaval C. Patel, Kaoutar El Maghraoui, Shuxin Lin, et al.

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Video Understanding

Yalun Dai, Hao Li, Shulin Tian, et al.

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Code Generation

Maria Ivanova, Pavel Zadorozhny, Rodion Levichev, et al.

Playful Agentic Robot Learning

Code Generation

Junyi Zhang, Jiaxin Ge, Hanjun Yoo, et al.

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Tianshan Zhang, Yijia Duan, Yanjun Li, et al.

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

Image Inpainting

Diffusion Model

Kangsheng Duan, Ziyang Xu, Wenyu Liu, et al.

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Reinforcement Learning

Minseo Kim, Minjae Lee, Seunghyuk Oh, et al.

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Jingyuan Huang, Zuming Huang, Yucheng Shi, et al.

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

3D Machine Vision

Yatai Ji, An-Chieh Cheng, Yang Fu, et al.

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Mingyue Cui, Linghui Shen, Xingyi Yang

Kairos: A Native World Model Stack for Physical AI

Kairos Team, Fei Wang, Shan You, et al.

Guava: An Effective and Universal Harness for Embodied Manipulation

Embodied Intelligence

Haowen Liu, Xirui Li, Shaoxiong Yao, et al.

OpenRath: Session-Centered Runtime State for Agent Systems

EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

World Action Models: A Survey

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

Training Software Engineering Agents and Verifiers with SWE-Gym

MAKIEVAL: A Multilingual Automatic WiKIdata-based Framework for Cultural Awareness Evaluation for LLMs

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Code World Models for General Game Playing

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Playful Agentic Robot Learning

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Kairos: A Native World Model Stack for Physical AI

Guava: An Effective and Universal Harness for Embodied Manipulation

OpenRath: Session-Centered Runtime State for Agent Systems

EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

World Action Models: A Survey

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

Training Software Engineering Agents and Verifiers with SWE-Gym

MAKIEVAL: A Multilingual Automatic WiKIdata-based Framework for Cultural Awareness Evaluation for LLMs

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Code World Models for General Game Playing

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Playful Agentic Robot Learning

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Kairos: A Native World Model Stack for Physical AI

Guava: An Effective and Universal Harness for Embodied Manipulation