HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Intelligent Question Answering

Jianhao Ruan, Zhihao Xu, Yiran Peng, et al.

No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs

No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs

Liyan Xu, Mo Yu, Fandong Meng, et al.

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Yuling Shi, Chaoxiang Xie, Zhensu Sun, et al.

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Yinger Zhang, Shutong Jiang, Renhao Li, et al.

CL-bench: A Benchmark for Context Learning

Intelligent Question Answering

Shihan Dou, Ming Zhang, Zhangyue Yin, et al.

Reinforcement Learning via Self-Distillation

Reinforcement Learning

Retrieval-Augmented Generation

Jonas Hübotter, Frederike Lübeck, Lejs Behric, et al.

Chatbots as social companions: How people perceive consciousness, human likeness, and social health benefits in machines

Human-Computer Interaction

Rose E. Guingrich, Michael S. A. Graziano

POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration

Reinforcement Learning

Yuxiao Qu, Amrith Setlur, Virginia Smith, et al.

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Dianyi Wang, Chaofan Ma, Feng Han, et al.

Closing the Loop: Universal Repository Representation with RPG-Encoder

Code Generation

Multimodal Representation

Jane Luo, Chengyu Yin, Xin Zhang, et al.

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Visual Question Answering

Yu Zeng, Wenxuan Huang, Zhen Fang, et al.

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Retrieval-Augmented Generation

Visual Question Answering

Wenxuan Huang, Yu Zeng, Qiuchen Wang, et al.

Kimi K2.5: Visual Agentic Intelligence

Multimodal Representation

Kimi Team, Tongtong Bai, Yifan Bai, et al.

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

I. Apanasevich, M. Artemyev, R. Babakyan, et al.

PaperBanana: Automating Academic Illustration for AI Scientists

Dawei Zhu, Rui Meng, Yale Song, et al.

Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on the Erdős Problems

Tony Feng, Trieu Trinh, Garrett Bingham, et al.

Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Jiecong Wang, Hao Peng, Chunyang Liu

Real-Time Aligned Reward Model beyond Semantics

Reinforcement Learning

Zixuan Huang, Xin Xia, Yuxi Ren, et al.

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

Diffusion Model

Supervised Fine-Tuning

Haoyou Deng, Keyu Yan, Chaojie Mao, et al.

DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning

Video Generation

Mingshuang Luo, Shuang Liang, Zhengkun Rong, et al.

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Chengyi Yang, Zhishang Xiang, Yunbo Tang, et al.

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Reinforcement Learning

Xiaoyu Tian, Haotian Wang, Shuaiting Chen, et al.

Self-Distillation Enables Continual Learning

Reinforcement Learning

Supervised Fine-Tuning

Idan Shenfeld, Mehul Damani, Jonas Hübotter, et al.

Towards Execution-Grounded Automated AI Research

Chenglei Si, Zitong Yang, Yejin Choi, et al.

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Embodied Intelligence

Haozhe Xie, Beichen Wen, Jiarui Zheng, et al.

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Honglin Lin, Zheng Liu, Yun Zhu, et al.

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Document Understanding

Yufeng Zhong, Lei Chen, Xuanle Zhao, et al.

Scaling Embeddings Outperforms Scaling Experts in Language Models

Retrieval-Augmented Generation

Hong Liu, Jiaqi Zhang, Chao Wang, et al.

Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

Tengyue Xu, Zhuoyang Qian, Gaoge Liu, et al.

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Diffusion Model

Zengbin Wang, Xuecai Hu, Yong Wang, et al.

Qwen3-ASR Technical Report

Audio and Speech Processing

Xian Shi, Xiong Wang, Zhifang Guo, et al.

Insight Agents: An LLM-Based Multi-Agent System for Data Insights

Intelligent Question Answering

Jincheng Bai, Zhenyu Zhang, Jennifer Zhang, et al.

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Intelligent Question Answering

Jianhao Ruan, Zhihao Xu, Yiran Peng, et al.

No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs

No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs

Liyan Xu, Mo Yu, Fandong Meng, et al.

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Yuling Shi, Chaoxiang Xie, Zhensu Sun, et al.

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Yinger Zhang, Shutong Jiang, Renhao Li, et al.

CL-bench: A Benchmark for Context Learning

Intelligent Question Answering

Shihan Dou, Ming Zhang, Zhangyue Yin, et al.

Reinforcement Learning via Self-Distillation

Reinforcement Learning

Retrieval-Augmented Generation

Jonas Hübotter, Frederike Lübeck, Lejs Behric, et al.

Chatbots as social companions: How people perceive consciousness, human likeness, and social health benefits in machines

Human-Computer Interaction

Rose E. Guingrich, Michael S. A. Graziano

POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration

Reinforcement Learning

Yuxiao Qu, Amrith Setlur, Virginia Smith, et al.

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Dianyi Wang, Chaofan Ma, Feng Han, et al.

Closing the Loop: Universal Repository Representation with RPG-Encoder

Code Generation

Multimodal Representation

Jane Luo, Chengyu Yin, Xin Zhang, et al.

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Visual Question Answering

Yu Zeng, Wenxuan Huang, Zhen Fang, et al.

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Retrieval-Augmented Generation

Visual Question Answering

Wenxuan Huang, Yu Zeng, Qiuchen Wang, et al.

Kimi K2.5: Visual Agentic Intelligence

Multimodal Representation

Kimi Team, Tongtong Bai, Yifan Bai, et al.

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

I. Apanasevich, M. Artemyev, R. Babakyan, et al.

PaperBanana: Automating Academic Illustration for AI Scientists

Dawei Zhu, Rui Meng, Yale Song, et al.

Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on the Erdős Problems

Tony Feng, Trieu Trinh, Garrett Bingham, et al.

Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Jiecong Wang, Hao Peng, Chunyang Liu

Real-Time Aligned Reward Model beyond Semantics

Reinforcement Learning

Zixuan Huang, Xin Xia, Yuxi Ren, et al.

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

Diffusion Model

Supervised Fine-Tuning

Haoyou Deng, Keyu Yan, Chaojie Mao, et al.

DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning

Video Generation

Mingshuang Luo, Shuang Liang, Zhengkun Rong, et al.

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Chengyi Yang, Zhishang Xiang, Yunbo Tang, et al.

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Reinforcement Learning

Xiaoyu Tian, Haotian Wang, Shuaiting Chen, et al.

Self-Distillation Enables Continual Learning

Reinforcement Learning

Supervised Fine-Tuning

Idan Shenfeld, Mehul Damani, Jonas Hübotter, et al.

Towards Execution-Grounded Automated AI Research

Chenglei Si, Zitong Yang, Yejin Choi, et al.

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Embodied Intelligence

Haozhe Xie, Beichen Wen, Jiarui Zheng, et al.

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Honglin Lin, Zheng Liu, Yun Zhu, et al.

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Document Understanding

Yufeng Zhong, Lei Chen, Xuanle Zhao, et al.

Scaling Embeddings Outperforms Scaling Experts in Language Models

Retrieval-Augmented Generation

Hong Liu, Jiaqi Zhang, Chao Wang, et al.

Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

Tengyue Xu, Zhuoyang Qian, Gaoge Liu, et al.

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Diffusion Model

Zengbin Wang, Xuecai Hu, Yong Wang, et al.

Qwen3-ASR Technical Report

Audio and Speech Processing

Xian Shi, Xiong Wang, Zhifang Guo, et al.

Insight Agents: An LLM-Based Multi-Agent System for Data Insights

Intelligent Question Answering

Jincheng Bai, Zhenyu Zhang, Jennifer Zhang, et al.

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

CL-bench: A Benchmark for Context Learning

Reinforcement Learning via Self-Distillation

Chatbots as social companions: How people perceive consciousness, human likeness, and social health benefits in machines

POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Closing the Loop: Universal Repository Representation with RPG-Encoder

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Kimi K2.5: Visual Agentic Intelligence

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

PaperBanana: Automating Academic Illustration for AI Scientists

Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on the Erdős Problems

Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Real-Time Aligned Reward Model beyond Semantics

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Self-Distillation Enables Continual Learning

Towards Execution-Grounded Automated AI Research

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Scaling Embeddings Outperforms Scaling Experts in Language Models

Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Qwen3-ASR Technical Report

Insight Agents: An LLM-Based Multi-Agent System for Data Insights

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

CL-bench: A Benchmark for Context Learning

Reinforcement Learning via Self-Distillation

Chatbots as social companions: How people perceive consciousness, human likeness, and social health benefits in machines

POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Closing the Loop: Universal Repository Representation with RPG-Encoder

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Kimi K2.5: Visual Agentic Intelligence

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

PaperBanana: Automating Academic Illustration for AI Scientists

Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on the Erdős Problems

Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Real-Time Aligned Reward Model beyond Semantics

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Self-Distillation Enables Continual Learning

Towards Execution-Grounded Automated AI Research

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Scaling Embeddings Outperforms Scaling Experts in Language Models

Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Qwen3-ASR Technical Report

Insight Agents: An LLM-Based Multi-Agent System for Data Insights