최신 연구 논문
매일 업데이트되는 최첨단 AI 연구 논문으로 최신 AI 트렌드를 파악하세요

SageAttention2++: A More Efficient Implementation of SageAttention2
Zhang, Jintao ; Xu, Xiaoming ; Wei, et al.
발행일: 5/29/2025

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
Lai Wei, Yuting Li, Chen Wang, et al.
발행일: 5/29/2025

SWE-rebench: An Automated Pipeline for Task Collection and
Decontaminated Evaluation of Software Engineering Agents
Badertdinov, Ibragim ; Golubev, Alexander ; Nekrashevich, et al.
발행일: 5/29/2025

Sherlock: Self-Correcting Reasoning in Vision-Language Models
Yi Ding, Ruqi Zhang
발행일: 5/29/2025

Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic
Confidence
Ghasemabadi, Amirhosein ; Mills, Keith G. ; Li, et al.
발행일: 5/29/2025

The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models
Ganqu Cui, Yuchen Zhang, Jiacheng Chen, et al.
발행일: 5/29/2025

PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization
Yezhi Shen, Qiuchen Zhai, Fengqing Zhu
발행일: 5/29/2025

3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model
Wenbo Hu, Yining Hong, Yanjun Wang, et al.
발행일: 5/29/2025

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied
Iterative Policy Optimization
Li, Yunxin ; Chen, Xinyu ; Li, et al.
발행일: 5/28/2025

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning
Logical Reasoning and Beyond
Liu, Junteng ; Fan, Yuanxiang ; Jiang, et al.
발행일: 5/28/2025