Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining































Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining






























Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models
MOVA: Towards Scalable and Synchronized Video-Audio Generation
MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions
Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making
Generative Modeling via Drifting
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
Learning to Reason in 13 Parameters
DFlash: Block Diffusion for Flash Speculative Decoding
Context Forcing: Consistent Autoregressive Video Generation with Long Context
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty
Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition
Native and Compact Structured Latents for 3D Generation
Continuous Audio Language Models
Evolving Interactive Diagnostic Agents in a Virtual Clinical Environment
WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation
Fara-7B: An Efficient Agentic Model for Computer Use
Fun-ASR Technical Report
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models
MOVA: Towards Scalable and Synchronized Video-Audio Generation
MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions
Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making
Generative Modeling via Drifting
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
Learning to Reason in 13 Parameters
DFlash: Block Diffusion for Flash Speculative Decoding
Context Forcing: Consistent Autoregressive Video Generation with Long Context
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty
Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition
Native and Compact Structured Latents for 3D Generation
Continuous Audio Language Models
Evolving Interactive Diagnostic Agents in a Virtual Clinical Environment
WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation
Fara-7B: An Efficient Agentic Model for Computer Use
Fun-ASR Technical Report