Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

QuitoBench: A High-Quality Open Time Series Forecasting Benchmark

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification































QuitoBench: A High-Quality Open Time Series Forecasting Benchmark

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification






























ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome
Terminal Agents Suffice for Enterprise Automation
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers
Cheap Bootstrap for Fast Uncertainty Quantification of Stochastic Gradient Descent
Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning
Early Exiting Predictive Coding Neural Networks for Edge AI
Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients
The capacity region of classes of product broadcast channels
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos
TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING
LightMover: Generative Light Movement with Color and Intensity Controls
Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation
Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation
Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition
A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI
Text Data Integration
Unified Number-Free Text-to-Motion Generation Via Flow Matching
SEAR: Schema-Based Evaluation and Routing for LLM Gateways
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
EpochX: Building the Infrastructure for an Emergent Agent Civilization
TAPS: Task Aware Proposal Distributions for Speculative Sampling
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments
World Reasoning Arena
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome
Terminal Agents Suffice for Enterprise Automation
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers
Cheap Bootstrap for Fast Uncertainty Quantification of Stochastic Gradient Descent
Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning
Early Exiting Predictive Coding Neural Networks for Edge AI
Quadratic Gradient: A Unified Framework Bridging Gradient Descent and Newton-Type Methods by Synthesizing Hessians and Gradients
The capacity region of classes of product broadcast channels
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos
TOOLACE: WINNING THE POINTS OF LLM FUNCTION CALLING
LightMover: Generative Light Movement with Color and Intensity Controls
Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation
Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation
Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition
A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI
Text Data Integration
Unified Number-Free Text-to-Motion Generation Via Flow Matching
SEAR: Schema-Based Evaluation and Routing for LLM Gateways
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
EpochX: Building the Infrastructure for an Emergent Agent Civilization
TAPS: Task Aware Proposal Distributions for Speculative Sampling
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments
World Reasoning Arena