Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

The Hunger Game Debate: On the Emergence of Over-Competition in Multi-Agent Systems

Training AI Co-Scientists Using Rubric Rewards































The Hunger Game Debate: On the Emergence of Over-Competition in Multi-Agent Systems

Training AI Co-Scientists Using Rubric Rewards






























AdaGaR: Adaptive Gabor Representation for Dynamic Scene Reconstruction
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
IQuest-Coder-V1 Technical Report
Recursive Language Models
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
On the Role of Discreteness in Diffusion LLMs
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Scaling Open-Ended Reasoning to Predict the Future
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction
mHC: Manifold-Constrained Hyper-Connections
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs
GraphLocator: Graph-guided Causal Reasoning for Issue Localization
Evaluating Parameter Efficient Methods for RLVR
End-to-End Test-Time Training for Long Context
DreamOmni3: Scribble-based Editing and Generation
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs
HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling
SpotEdit: Selective Region Editing in Diffusion Transformers
AdaGaR: Adaptive Gabor Representation for Dynamic Scene Reconstruction
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
IQuest-Coder-V1 Technical Report
Recursive Language Models
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
On the Role of Discreteness in Diffusion LLMs
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Scaling Open-Ended Reasoning to Predict the Future
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction
mHC: Manifold-Constrained Hyper-Connections
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs
GraphLocator: Graph-guided Causal Reasoning for Issue Localization
Evaluating Parameter Efficient Methods for RLVR
End-to-End Test-Time Training for Long Context
DreamOmni3: Scribble-based Editing and Generation
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs
HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling
SpotEdit: Selective Region Editing in Diffusion Transformers