HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning

Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning

Diffusion Model

Medical Imaging

Qiankun Zuo, Bangjun Lei, Wanyu Qiu, et al.

Early Exiting Predictive Coding Neural Networks for Edge AI

Early Exiting Predictive Coding Neural Networks for Edge AI

Image Classification

Alaa Zniber, Mounir Ghogho, Ouassim Karrakchou, et al.

Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients

The capacity region of classes of product broadcast channels

Yanlin Geng, Amin Gohari, Chandra Nair, et al.

Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

Medical Imaging

Visual Question Answering

Abdullah Hamdi, Changchun Yang, Xin Gao

TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING

Supervised Fine-Tuning

Weiwen Liu, Xu Huang, Xingshan Zeng, et al.

LightMover: Generative Light Movement with Color and Intensity Controls

Diffusion Model

Gengze Zhou, Tianyu Wang, Soo Ye Kim, et al.

Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation

Autonomous Driving

Reinforcement Learning

Matej Rene Cihlar, Luka Šiktar, Branimir Ćaran, et al.

Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Diffusion Model

Semantic Segmentation

Guohuan Xie, Xin He, Dingying Fan, et al.

Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition

Audio Recognition

Hao Shi, Yuan Gao, Xugang Lu, et al.

A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI

Medical Imaging

Kirill Skobelev, Eric Fithian, Yegor Baranovski, et al.

Text Data Integration

Natural Language Processing

Md Ataur Rahman, Dimitris Sacharidis, Oscar Romero, et al.

Unified Number-Free Text-to-Motion Generation Via Flow Matching

Diffusion Model

Guanhe Huang, Oya Celiktutan

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

Text Generation

Zecheng Zhang, Han Zheng, Yue Xu

On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

Diffusion Model

Omer Dahary, Benaya Koren, Daniel Garibi, et al.

EpochX: Building the Infrastructure for an Emergent Agent Civilization

Huacan Wang, Chaofa Yuan, Xialie Zhuang, et al.

TAPS: Task Aware Proposal Distributions for Speculative Sampling

Text Generation

Mohamad Zbib, Mohamad Bazzi, Ammar Mohanna, et al.

LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset

Autonomous Driving

Royden Wagner, Omer Sahin Tas, Jaime Villa, et al.

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Code Generation

Jiajun Zhang, Yuying Li, Zhixun Li, et al.

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Jingwei Ni, Yihao Liu, Xinpeng Liu, et al.

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Video Generation

Diffusion Model

Xiaofeng Mao, Shaohao Rui, Kaining Ying, et al.

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Video Generation

Yawen Luo, Xiaoyu Shi, Junhao Zhuang, et al.

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Video Generation

Object Tracking

Kaijin Chen, Dingkang Liang, Xin Zhou, et al.

BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments

Yuxuan Li, Yi Lin, Peng Wang, et al.

World Reasoning Arena

Qiyue Gao, Kun Zhou, Jiannan Xiang, et al.

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Retrieval-Augmented Generation

Yu Chen, Runkai Chen, Sheng Yi, et al.

Voxtral TTS

Alexander H. Liu, Alexis Tacnet, Andy Ehrenberg, et al.

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Diffusion Model

Yufeng Yang, Xianfang Zeng, Zhangqi Jiang, et al.

Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration

Diffusion Model

Danil Tokhchukov, Aysel Mirzoeva, Andrey Kuznetsov, et al.

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Yicheng Zou, Dongsheng Zhu, Lin Zhu, et al.

PixelSmile: Toward Fine-Grained Facial Expression Editing

Diffusion Model

Jiabin Hua, Hengyuan Xu, Aojie Li, et al.

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

Alexander Panfilov, Peter Romov, Igor Shilov, et al.

Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning

Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning

Diffusion Model

Medical Imaging

Qiankun Zuo, Bangjun Lei, Wanyu Qiu, et al.

Early Exiting Predictive Coding Neural Networks for Edge AI

Early Exiting Predictive Coding Neural Networks for Edge AI

Image Classification

Alaa Zniber, Mounir Ghogho, Ouassim Karrakchou, et al.

Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients

The capacity region of classes of product broadcast channels

Yanlin Geng, Amin Gohari, Chandra Nair, et al.

Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

Medical Imaging

Visual Question Answering

Abdullah Hamdi, Changchun Yang, Xin Gao

TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING

Supervised Fine-Tuning

Weiwen Liu, Xu Huang, Xingshan Zeng, et al.

LightMover: Generative Light Movement with Color and Intensity Controls

Diffusion Model

Gengze Zhou, Tianyu Wang, Soo Ye Kim, et al.

Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation

Autonomous Driving

Reinforcement Learning

Matej Rene Cihlar, Luka Šiktar, Branimir Ćaran, et al.

Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Diffusion Model

Semantic Segmentation

Guohuan Xie, Xin He, Dingying Fan, et al.

Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition

Audio Recognition

Hao Shi, Yuan Gao, Xugang Lu, et al.

A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI

Medical Imaging

Kirill Skobelev, Eric Fithian, Yegor Baranovski, et al.

Text Data Integration

Natural Language Processing

Md Ataur Rahman, Dimitris Sacharidis, Oscar Romero, et al.

Unified Number-Free Text-to-Motion Generation Via Flow Matching

Diffusion Model

Guanhe Huang, Oya Celiktutan

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

Text Generation

Zecheng Zhang, Han Zheng, Yue Xu

On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

Diffusion Model

Omer Dahary, Benaya Koren, Daniel Garibi, et al.

EpochX: Building the Infrastructure for an Emergent Agent Civilization

Huacan Wang, Chaofa Yuan, Xialie Zhuang, et al.

TAPS: Task Aware Proposal Distributions for Speculative Sampling

Text Generation

Mohamad Zbib, Mohamad Bazzi, Ammar Mohanna, et al.

LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset

Autonomous Driving

Royden Wagner, Omer Sahin Tas, Jaime Villa, et al.

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Code Generation

Jiajun Zhang, Yuying Li, Zhixun Li, et al.

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Jingwei Ni, Yihao Liu, Xinpeng Liu, et al.

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Video Generation

Diffusion Model

Xiaofeng Mao, Shaohao Rui, Kaining Ying, et al.

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Video Generation

Yawen Luo, Xiaoyu Shi, Junhao Zhuang, et al.

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Video Generation

Object Tracking

Kaijin Chen, Dingkang Liang, Xin Zhou, et al.

BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments

Yuxuan Li, Yi Lin, Peng Wang, et al.

World Reasoning Arena

Qiyue Gao, Kun Zhou, Jiannan Xiang, et al.

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Retrieval-Augmented Generation

Yu Chen, Runkai Chen, Sheng Yi, et al.

Voxtral TTS

Alexander H. Liu, Alexis Tacnet, Andy Ehrenberg, et al.

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Diffusion Model

Yufeng Yang, Xianfang Zeng, Zhangqi Jiang, et al.

Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration

Diffusion Model

Danil Tokhchukov, Aysel Mirzoeva, Andrey Kuznetsov, et al.

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Yicheng Zou, Dongsheng Zhu, Lin Zhu, et al.

PixelSmile: Toward Fine-Grained Facial Expression Editing

Diffusion Model

Jiabin Hua, Hengyuan Xu, Aojie Li, et al.

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

Alexander Panfilov, Peter Romov, Igor Shilov, et al.

Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients

The capacity region of classes of product broadcast channels

Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING

LightMover: Generative Light Movement with Color and Intensity Controls

Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation

Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition

A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI

Text Data Integration

Unified Number-Free Text-to-Motion Generation Via Flow Matching

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

EpochX: Building the Infrastructure for an Emergent Agent Civilization

TAPS: Task Aware Proposal Distributions for Speculative Sampling

LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments

World Reasoning Arena

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Voxtral TTS

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

PixelSmile: Toward Fine-Grained Facial Expression Editing

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients

The capacity region of classes of product broadcast channels

Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING

LightMover: Generative Light Movement with Color and Intensity Controls

Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation

Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition

A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI

Text Data Integration

Unified Number-Free Text-to-Motion Generation Via Flow Matching

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

EpochX: Building the Infrastructure for an Emergent Agent Civilization

TAPS: Task Aware Proposal Distributions for Speculative Sampling

LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments

World Reasoning Arena

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Voxtral TTS

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

PixelSmile: Toward Fine-Grained Facial Expression Editing

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs