HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

LMEnt: A Suite for Analyzing Knowledge in Language Models from
Pretraining Data to Representations

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Daniela Gottesman, Alon Gilae-Dotan, Ido Cohen, et al.

Open Data Synthesis For Deep Research

Open Data Synthesis For Deep Research

Ziyi Xia, Kun Luo, Hongjin Qian, et al.

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Embodied Intelligence

Huang Fang, Mengxi Zhang, Heng Dong, et al.

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

Multimodal Representation

Dan Kalifa, Uriel Singer, Kira Radinsky

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

Multi-Task Learning

Xingxuan Zhang, Gang Ren, Han Yu, et al.

epiGPTope: A machine learning-based epitope generator and classifier

Natalia Flechas Manrique, Alberto Martínez, Elena López-Martínez, et al.

GenCompositor: Generative Video Compositing with Diffusion Transformer

Video Generation

Video Processing

Shuzhou Yang, Xiaoyu Li, Xiaodong Cun, et al.

DCPO: Dynamic Clipping Policy Optimization

Reinforcement Learning

Shihui Yang, Chengfeng Dou, Peidong Guo, et al.

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic

Mohammad Zbeeb, Hasan Abed Al Kader Hammoud, Bernard Ghanem

Baichuan-M2: Scaling Medical Capability with Large Verifier System

Baichuan-M2 Team, Chengfeng Dou, Chong Liu, et al.

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Dongfu Jiang, Yi Lu, Zhuofeng Li, et al.

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Hao Lu, Jiahao Wang, Yaolun Zhang, et al.

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Christopher F. Brown, Michal R. Kazmierski, Valerie J. Pasquarella, et al.

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

Code Generation

Zihan Wang, Jiaze Chen, Zhicheng Liu, et al.

TileLang: A Composable Tiled Programming Model for AI Systems

Wang Lei, Cheng Yu, Shi Yining, et al.

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Sara Vera Marjanović, Arkil Patel, Vaibhav Adlakha, et al.

Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Retrieval-Augmented Generation

Mohsen Nayebi Kerdabadi, Arya Hadizadeh Moghaddam, Dongjie Wang, Zijun Yao

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture

Yeawon Lee, Xiaoyang Wang, Christopher C. Yang

SmolDocling: An ultra-compact vision-language model for end-to-end
multi-modal document conversion

Document Understanding

Ahmed Nassar, Andres Marafioti, Matteo Omenetti, et al.

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Document Understanding

Luca Soldaini, Kyle Lo, Christopher Wilhelm, et al.

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$-bench

Venkatesh Mishra, Amir Saeidi, Satyam Raj, et al.

UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via
HUMAIN Chat

Natural Language Processing

From reactive to cognitive: brain-inspired spatial intelligence for
embodied agents

Embodied Intelligence

Shouwei Ruan, Liyuan Wang, Caixin Kang, et al.

No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes

Computer Vision

Object Detection

Blaž Rolih, Matic Fučka, Danijel Skočaj

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Jie Zhang, Changzai Pan, Kaiwen Wei, et al.

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Reinforcement Learning

Wenfeng Feng, Penghong Zhao, Guochao Jiang, et al.

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Preference Modeling

Reinforcement Learning

Yuntao Bai, Andy Jones, Kamal Ndousse, et al.

UQ: Assessing Language Models on Unsolved Questions

Fan Nie, Ken Ziyu Liu, Zihao Wang, et al.

CARJAN: Agent-Based Generation and Simulation of Traffic Scenarios with AJAN

Autonomous Driving

Leonard Frank Neis, Andre Antakli, Matthias Klusch

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model
Pre-training

Yifan Wang, Binbin Liu, Fengze Liu, et al.

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Shunian Chen, Hejin Huang, Yexin Liu, et al.

Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation

Video Understanding

Xiaochuan Li, Guoguang Du, Runze Zhang, et al.

LMEnt: A Suite for Analyzing Knowledge in Language Models from
Pretraining Data to Representations

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Daniela Gottesman, Alon Gilae-Dotan, Ido Cohen, et al.

Open Data Synthesis For Deep Research

Open Data Synthesis For Deep Research

Ziyi Xia, Kun Luo, Hongjin Qian, et al.

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Embodied Intelligence

Huang Fang, Mengxi Zhang, Heng Dong, et al.

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

Multimodal Representation

Dan Kalifa, Uriel Singer, Kira Radinsky

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

Multi-Task Learning

Xingxuan Zhang, Gang Ren, Han Yu, et al.

epiGPTope: A machine learning-based epitope generator and classifier

Natalia Flechas Manrique, Alberto Martínez, Elena López-Martínez, et al.

GenCompositor: Generative Video Compositing with Diffusion Transformer

Video Generation

Video Processing

Shuzhou Yang, Xiaoyu Li, Xiaodong Cun, et al.

DCPO: Dynamic Clipping Policy Optimization

Reinforcement Learning

Shihui Yang, Chengfeng Dou, Peidong Guo, et al.

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic

Mohammad Zbeeb, Hasan Abed Al Kader Hammoud, Bernard Ghanem

Baichuan-M2: Scaling Medical Capability with Large Verifier System

Baichuan-M2 Team, Chengfeng Dou, Chong Liu, et al.

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Dongfu Jiang, Yi Lu, Zhuofeng Li, et al.

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Hao Lu, Jiahao Wang, Yaolun Zhang, et al.

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Christopher F. Brown, Michal R. Kazmierski, Valerie J. Pasquarella, et al.

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

Code Generation

Zihan Wang, Jiaze Chen, Zhicheng Liu, et al.

TileLang: A Composable Tiled Programming Model for AI Systems

Wang Lei, Cheng Yu, Shi Yining, et al.

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Sara Vera Marjanović, Arkil Patel, Vaibhav Adlakha, et al.

Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Retrieval-Augmented Generation

Mohsen Nayebi Kerdabadi, Arya Hadizadeh Moghaddam, Dongjie Wang, Zijun Yao

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture

Yeawon Lee, Xiaoyang Wang, Christopher C. Yang

SmolDocling: An ultra-compact vision-language model for end-to-end
multi-modal document conversion

Document Understanding

Ahmed Nassar, Andres Marafioti, Matteo Omenetti, et al.

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Document Understanding

Luca Soldaini, Kyle Lo, Christopher Wilhelm, et al.

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$-bench

Venkatesh Mishra, Amir Saeidi, Satyam Raj, et al.

UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via
HUMAIN Chat

Natural Language Processing

From reactive to cognitive: brain-inspired spatial intelligence for
embodied agents

Embodied Intelligence

Shouwei Ruan, Liyuan Wang, Caixin Kang, et al.

No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes

Computer Vision

Object Detection

Blaž Rolih, Matic Fučka, Danijel Skočaj

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Jie Zhang, Changzai Pan, Kaiwen Wei, et al.

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Reinforcement Learning

Wenfeng Feng, Penghong Zhao, Guochao Jiang, et al.

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Preference Modeling

Reinforcement Learning

Yuntao Bai, Andy Jones, Kamal Ndousse, et al.

UQ: Assessing Language Models on Unsolved Questions

Fan Nie, Ken Ziyu Liu, Zihao Wang, et al.

CARJAN: Agent-Based Generation and Simulation of Traffic Scenarios with AJAN

Autonomous Driving

Leonard Frank Neis, Andre Antakli, Matthias Klusch

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model
Pre-training

Yifan Wang, Binbin Liu, Fengze Liu, et al.

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Shunian Chen, Hejin Huang, Yexin Liu, et al.

Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation

Video Understanding

Xiaochuan Li, Guoguang Du, Runze Zhang, et al.

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

epiGPTope: A machine learning-based epitope generator and classifier

GenCompositor: Generative Video Compositing with Diffusion Transformer

DCPO: Dynamic Clipping Policy Optimization

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Baichuan-M2: Scaling Medical Capability with Large Verifier System

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

TileLang: A Composable Tiled Programming Model for AI Systems

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$ -bench

UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat

From reactive to cognitive: brain-inspired spatial intelligence for embodied agents

No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

UQ: Assessing Language Models on Unsolved Questions

CARJAN: Agent-Based Generation and Simulation of Traffic Scenarios with AJAN

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

FusionProt: Fusing Sequence and Structural Information for Unified Protein Representation Learning

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence

epiGPTope: A machine learning-based epitope generator and classifier

GenCompositor: Generative Video Compositing with Diffusion Transformer

DCPO: Dynamic Clipping Policy Optimization

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Baichuan-M2: Scaling Medical Capability with Large Verifier System

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

TileLang: A Composable Tiled Programming Model for AI Systems

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$ -bench

UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat

From reactive to cognitive: brain-inspired spatial intelligence for embodied agents

No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

UQ: Assessing Language Models on Unsolved Questions

CARJAN: Agent-Based Generation and Simulation of Traffic Scenarios with AJAN

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation