最新論文
日々更新される最先端AI研究論文、人工知能の最新動向を把握

LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
Pascal Chang, Sergio Sancho, Jingwei Tang, et al.
公開日: 4/23/2025

Complex-Edit: CoT-Like Instruction Generation for
Complexity-Controllable Image Editing Benchmark
Siwei Yang, Mude Hui, Bingchen Zhao, et al.
公開日: 4/23/2025

Thought Manipulation: External Thought Can Be Efficient for Large
Reasoning Models
Yule Liu, Jingyi Zheng, Zhen Sun, et al.
公開日: 4/23/2025

Efficient Pretraining Length Scaling
Bohong Wu, Shen Yan, Sijun Zhang, et al.
公開日: 4/23/2025

FocusedAD: Character-centric Movie Audio Description
Xiaojun Ye, Chun Wang, Yiren Song, et al.
公開日: 4/23/2025

LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration
Benchmark
Guangyi Liu, Pengxiang Zhao, Liang Liu, et al.
公開日: 4/23/2025

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning
in Multimodal LLMs
David Ma, Yuanxing Zhang, Jincheng Ren, et al.
公開日: 4/23/2025

Tokenize Image Patches: Global Context Fusion for Effective Haze Removal
in Large Images
Jiuchen Chen, Xinyu Yan, Qizhi Xu, et al.
公開日: 4/23/2025

CheXWorld: Exploring Image World Modeling for Radiograph Representation
Learning
Yang Yue, Yulin Wang, Chenxin Tao, et al.
公開日: 4/23/2025

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to
Deliberative Reasoners
Yuhang Liu, Pengxiang Li, Congkai Xie, et al.
公開日: 4/23/2025