Latest Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
Xingyu Wu; Yuchen Yan; Shangke Lyu; Linjuan Wu; Yiwen Qiu; Yongliang Shen; Weiming Lu; Jian Shao; Jun Xiao; Yueting Zhuang
2 days ago

MUR: Momentum Uncertainty guided Reasoning for Large Language Models
Hang Yan; Fangzhi Xu; Rongman Xu; Yifei Li; Jian Zhang; Haoran Luo; Xiaobao Wu; Luu Anh Tuan; Haiteng Zhao; Qika Lin; Jun Liu
2 days ago

$\nabla$NABLA: Neighborhood Adaptive Block-Level Attention
Dmitrii Mikhailov; Aleksey Letunovskiy; Maria Kovaleva; Vladimir Arkhipkin; Vladimir Korviakov; Vladimir Polovnikov; Viacheslav Vasilev; Evelina Sidorova; Denis Dimitrov
2 days ago

Group Sequence Policy Optimization
Chujie Zheng, Shixuan Liu, Mingze Li, et al.
2 days ago

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law
Yicheng Bao, Guanxu Chen, Mingkang Chen, et al.
5 days ago

Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Mutian Yang, Jiandong Gao, Ji Wu
5 days ago

Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny
Chuanhao Yan; Fengdi Che; Xuhan Huang; Xu Xu; Xin Li; Yizhi Li; Xingwei Qu; Jingzhe Shi; Zhuangzhuang He; Chenghua Lin; Yaodong Yang; Binhang Yuan; Hang Zhao; Yu Qiao; Bowen Zhou; Jie Fu
5 days ago

RAVine: Reality-Aligned Evaluation for Agentic Search
Yilong Xu; Xiang Long; Zhi Zheng; Jinhua Gao
5 days ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain
Reasoning via Reinforcement Learning
Yu Li, Zhuoshi Pan, Honglin Lin, et al.
5 days ago

DesignLab: Designing Slides Through Iterative Detection and Correction
Jooyeol Yun, Heng Wang, Yotaro Shimose, et al.
5 days ago