Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

Extract-0: A Specialized Language Model for Document Information Extraction































PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

Extract-0: A Specialized Language Model for Document Information Extraction






























OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction
WildSpeech-Bench: Benchmarking End-to-End SpeechLLMs in the Wild
Token-Aware Editing of Internal Activations for Large Language Model Alignment
Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling
Agent Learning via Early Experience
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Qwen2.5 Technical Report
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning
CoDA: Coding LM via Diffusion Adaptation
Fast-dLLM v2: Efficient Block-Diffusion LLM
Less is More: Recursive Reasoning with Tiny Networks
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Hybrid Architectures for Language Models: Systematic Analysis and Design Insights
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Imperceptible Jailbreaking against Large Language Models
VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Paper2Video: Automatic Video Generation from Scientific Papers
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
Self-Improvement in Multimodal Large Language Models: A Survey
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction
WildSpeech-Bench: Benchmarking End-to-End SpeechLLMs in the Wild
Token-Aware Editing of Internal Activations for Large Language Model Alignment
Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling
Agent Learning via Early Experience
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Qwen2.5 Technical Report
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning
CoDA: Coding LM via Diffusion Adaptation
Fast-dLLM v2: Efficient Block-Diffusion LLM
Less is More: Recursive Reasoning with Tiny Networks
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Hybrid Architectures for Language Models: Systematic Analysis and Design Insights
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Imperceptible Jailbreaking against Large Language Models
VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Paper2Video: Automatic Video Generation from Scientific Papers
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
Self-Improvement in Multimodal Large Language Models: A Survey
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition