HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

ASPIRE: Agentic /Skills Discovery for Robotics

ASPIRE: Agentic /Skills Discovery for Robotics

Runyu Lu, Yubo Wu, Ethan Kou, et al.

AUTOMEM: Automated Learning of Memory as a Cognitive Skill

AUTOMEM: Automated Learning of Memory as a Cognitive Skill

Shengguang Wu, Hao Zhu, Yuhui Zhang, et al.

The Decode-Work Law: Margin-Governed, Provably-Exact Spatial Joins over Compressed Geometry

Geographic Information

Madhulatha Mandarapu, Sandeep Kunkunuru

Neural Certificate Pricing for Combinatorial Optimization Problems

Jingyi Chen, Xinyuan Zhang, Xinwu Qian

Optimal Resource Utilization for Autonomous Laboratory Orchestrators

Austin McDannald, Julia Tisaranni, Howie Joress

TERA: A Unified Taylor Model Enabled Reachability Analysis Framework

Salma Iraky, Andrew Sogokon

Perceive-to-Reason: Decoupling Perception and Reasoning for Fine-Grained Visual Reasoning

Visual Question Answering

Hongxing Li, Xiufeng Huang, Dingming Li, et al.

Trie-based Experiment Plans for Efficient IR Pipeline Experiments

Irene Anu, Craig Macdonald

On the Nonlinearity of Learning Rate Scaling for LLM Training

Zaiwen Yang, Huaqing Zhang, Jing Xu, et al.

Scenes as Objects, Not Primitives: Instance-Structured 3D Tokenization from Unposed Views

3D Machine Vision

Mijin Yoo, In Cho, Subin Jeon, et al.

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Diffusion Model

Hao Zhang, Yiming Hu, Yong Wang, et al.

DOPD: Dual On-policy Distillation

Xinlei Yu, Gen Li, Qingyi Si, et al.

Dockerless: Environment-Free Program Verifier for Coding Agents

Supervised Fine-Tuning

Wenhao Zeng, Yuling Shi, Xiaodong Gu, et al.

Orca: The World is in Your Mind

Text Generation

Orca Team, Yihao Wang, Yuheng Ji, et al.

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Zhengqing Yuan, Hanchi Sun, Lichao Sun, et al.

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

Reinforcement Learning

Aneesh Muppidi, Firas Darwish, Dylan Cope, et al.

What do near-optimal learning rate schedules look like?

Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, et al.

Beyond IID: How General Are Tabular Foundation Models, Really?

Lennart Purucker, Andrej Tschalzev, Nick Erickson, et al.

ReFreeKV: Towards Threshold-Free KV Cache Compression

Xuanfan Ni, Liyan Xu, Chenyang Lyu, et al.

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

Shoufa Chen, Luyuan Wang, Xuan Yang, et al.

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Supervised Fine-Tuning

Agents-A1 Team, Zongsheng Cao, Bihao Zhan, et al.

LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing

Diffusion Model

Video Processing

Xinyu Wang, Chongbo Zhao, Fangneng Zhan, et al.

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Han Luo, Bingbing Wen, Lucy Lu Wang

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Tara Bogavelli, Gabrielle Gauthier Melançon, Katrina Stankiewicz, et al.

SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning

SingGuard Team, Yan Hong, Hongcheng Li, et al.

Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs

Fahd Seddik, Fatemeh Fard

MultiHashFormer: Hash-based Generative Language Models

Text Generation

Huiyin Xue, Atsuki Yamaguchi, Nikolaos Aletras

Qwen-Image-2.0-RL Technical Report

Diffusion Model

Yixian Xu, Kaiyuan Gao, Yuxiang Chen, et al.

Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots

Sijin Chen, Kaixuan Jiang, Haixin Shi, et al.

PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation

Video Generation

Diffusion Model

Peiwen Zhang, Yufan Deng, Shangkun Sun, et al.

OpenTME: An Open Dataset of AI-powered H&E Tumor Microenvironment Profiles from TCGA

Image Segmentation

Maaike Galama, Nina Kozar-Gillan, Christina Embacher, et al.

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

Ted Zadouri, Markus Hoehnerbach, Jay Shah, et al.

ASPIRE: Agentic /Skills Discovery for Robotics

ASPIRE: Agentic /Skills Discovery for Robotics

Runyu Lu, Yubo Wu, Ethan Kou, et al.

AUTOMEM: Automated Learning of Memory as a Cognitive Skill

AUTOMEM: Automated Learning of Memory as a Cognitive Skill

Shengguang Wu, Hao Zhu, Yuhui Zhang, et al.

The Decode-Work Law: Margin-Governed, Provably-Exact Spatial Joins over Compressed Geometry

Geographic Information

Madhulatha Mandarapu, Sandeep Kunkunuru

Neural Certificate Pricing for Combinatorial Optimization Problems

Jingyi Chen, Xinyuan Zhang, Xinwu Qian

Optimal Resource Utilization for Autonomous Laboratory Orchestrators

Austin McDannald, Julia Tisaranni, Howie Joress

TERA: A Unified Taylor Model Enabled Reachability Analysis Framework

Salma Iraky, Andrew Sogokon

Perceive-to-Reason: Decoupling Perception and Reasoning for Fine-Grained Visual Reasoning

Visual Question Answering

Hongxing Li, Xiufeng Huang, Dingming Li, et al.

Trie-based Experiment Plans for Efficient IR Pipeline Experiments

Irene Anu, Craig Macdonald

On the Nonlinearity of Learning Rate Scaling for LLM Training

Zaiwen Yang, Huaqing Zhang, Jing Xu, et al.

Scenes as Objects, Not Primitives: Instance-Structured 3D Tokenization from Unposed Views

3D Machine Vision

Mijin Yoo, In Cho, Subin Jeon, et al.

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Diffusion Model

Hao Zhang, Yiming Hu, Yong Wang, et al.

DOPD: Dual On-policy Distillation

Xinlei Yu, Gen Li, Qingyi Si, et al.

Dockerless: Environment-Free Program Verifier for Coding Agents

Supervised Fine-Tuning

Wenhao Zeng, Yuling Shi, Xiaodong Gu, et al.

Orca: The World is in Your Mind

Text Generation

Orca Team, Yihao Wang, Yuheng Ji, et al.

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Zhengqing Yuan, Hanchi Sun, Lichao Sun, et al.

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

Reinforcement Learning

Aneesh Muppidi, Firas Darwish, Dylan Cope, et al.

What do near-optimal learning rate schedules look like?

Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, et al.

Beyond IID: How General Are Tabular Foundation Models, Really?

Lennart Purucker, Andrej Tschalzev, Nick Erickson, et al.

ReFreeKV: Towards Threshold-Free KV Cache Compression

Xuanfan Ni, Liyan Xu, Chenyang Lyu, et al.

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

Shoufa Chen, Luyuan Wang, Xuan Yang, et al.

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Supervised Fine-Tuning

Agents-A1 Team, Zongsheng Cao, Bihao Zhan, et al.

LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing

Diffusion Model

Video Processing

Xinyu Wang, Chongbo Zhao, Fangneng Zhan, et al.

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Han Luo, Bingbing Wen, Lucy Lu Wang

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Tara Bogavelli, Gabrielle Gauthier Melançon, Katrina Stankiewicz, et al.

SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning

SingGuard Team, Yan Hong, Hongcheng Li, et al.

Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs

Fahd Seddik, Fatemeh Fard

MultiHashFormer: Hash-based Generative Language Models

Text Generation

Huiyin Xue, Atsuki Yamaguchi, Nikolaos Aletras

Qwen-Image-2.0-RL Technical Report

Diffusion Model

Yixian Xu, Kaiyuan Gao, Yuxiang Chen, et al.

Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots

Sijin Chen, Kaixuan Jiang, Haixin Shi, et al.

PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation

Video Generation

Diffusion Model

Peiwen Zhang, Yufan Deng, Shangkun Sun, et al.

OpenTME: An Open Dataset of AI-powered H&E Tumor Microenvironment Profiles from TCGA

Image Segmentation

Maaike Galama, Nina Kozar-Gillan, Christina Embacher, et al.

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

Ted Zadouri, Markus Hoehnerbach, Jay Shah, et al.

The Decode-Work Law: Margin-Governed, Provably-Exact Spatial Joins over Compressed Geometry

Neural Certificate Pricing for Combinatorial Optimization Problems

Optimal Resource Utilization for Autonomous Laboratory Orchestrators

TERA: A Unified Taylor Model Enabled Reachability Analysis Framework

Perceive-to-Reason: Decoupling Perception and Reasoning for Fine-Grained Visual Reasoning

Trie-based Experiment Plans for Efficient IR Pipeline Experiments

On the Nonlinearity of Learning Rate Scaling for LLM Training

Scenes as Objects, Not Primitives: Instance-Structured 3D Tokenization from Unposed Views

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

DOPD: Dual On-policy Distillation

Dockerless: Environment-Free Program Verifier for Coding Agents

Orca: The World is in Your Mind

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

What do near-optimal learning rate schedules look like?

Beyond IID: How General Are Tabular Foundation Models, Really?

ReFreeKV: Towards Threshold-Free KV Cache Compression

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning

Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs

MultiHashFormer: Hash-based Generative Language Models

Qwen-Image-2.0-RL Technical Report

Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots

PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation

OpenTME: An Open Dataset of AI-powered H&E Tumor Microenvironment Profiles from TCGA

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

The Decode-Work Law: Margin-Governed, Provably-Exact Spatial Joins over Compressed Geometry

Neural Certificate Pricing for Combinatorial Optimization Problems

Optimal Resource Utilization for Autonomous Laboratory Orchestrators

TERA: A Unified Taylor Model Enabled Reachability Analysis Framework

Perceive-to-Reason: Decoupling Perception and Reasoning for Fine-Grained Visual Reasoning

Trie-based Experiment Plans for Efficient IR Pipeline Experiments

On the Nonlinearity of Learning Rate Scaling for LLM Training

Scenes as Objects, Not Primitives: Instance-Structured 3D Tokenization from Unposed Views

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

DOPD: Dual On-policy Distillation

Dockerless: Environment-Free Program Verifier for Coding Agents

Orca: The World is in Your Mind

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

What do near-optimal learning rate schedules look like?

Beyond IID: How General Are Tabular Foundation Models, Really?

ReFreeKV: Towards Threshold-Free KV Cache Compression

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning

Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs

MultiHashFormer: Hash-based Generative Language Models

Qwen-Image-2.0-RL Technical Report

Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots

PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation

OpenTME: An Open Dataset of AI-powered H&E Tumor Microenvironment Profiles from TCGA

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling