Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling































CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling






























Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
Understanding Tool-Integrated Reasoning
Spacer: Towards Engineered Scientific Inspiration
Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling
VibeVoice Technical Report
MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs
MV-RAG: Retrieval Augmented Multiview Diffusion
Connecting metal-organic framework synthesis to applications using multimodal machine learning
Model Context Protocols in Adaptive Transport Systems: A Survey
Algorithmic Collective Action with Multiple Collectives
OpenCUA: Open Foundations for Computer-Use Agents
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
CRISP: Persistent Concept Unlearning via Sparse Autoencoders
Selective Contrastive Learning for Weakly Supervised Affordance Grounding
EgoTwin: Dreaming Body and View in First Person
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning
LLM-Based Agents for Competitive Landscape Mapping in Drug Asset Due Diligence
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
A Survey on Large Language Model Benchmarks
Waver: Wave Your Way to Lifelike Video Generation
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Deep Think with Confidence
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Intern-S1: A Scientific Multimodal Foundation Model
Language-Guided Tuning: Enhancing Numeric Optimization with Textual Feedback
NiceWebRL: a Python library for human subject experiments with reinforcement learning environments
Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
Understanding Tool-Integrated Reasoning
Spacer: Towards Engineered Scientific Inspiration
Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling
VibeVoice Technical Report
MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs
MV-RAG: Retrieval Augmented Multiview Diffusion
Connecting metal-organic framework synthesis to applications using multimodal machine learning
Model Context Protocols in Adaptive Transport Systems: A Survey
Algorithmic Collective Action with Multiple Collectives
OpenCUA: Open Foundations for Computer-Use Agents
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
CRISP: Persistent Concept Unlearning via Sparse Autoencoders
Selective Contrastive Learning for Weakly Supervised Affordance Grounding
EgoTwin: Dreaming Body and View in First Person
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning
LLM-Based Agents for Competitive Landscape Mapping in Drug Asset Due Diligence
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
A Survey on Large Language Model Benchmarks
Waver: Wave Your Way to Lifelike Video Generation
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Deep Think with Confidence
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Intern-S1: A Scientific Multimodal Foundation Model
Language-Guided Tuning: Enhancing Numeric Optimization with Textual Feedback
NiceWebRL: a Python library for human subject experiments with reinforcement learning environments