Latest Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
Zijian Wu, Jinjie Ni, Xiangyan Liu, et al.
Release Date: 6/4/2025

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in
Multi-Agent Environments
Zelai Xu, Zhexuan Xu, Xiangmin Yi, et al.
Release Date: 6/4/2025

CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning
Capabilities of VLMs
Jian, Ai ; Qiu, Weijie ; Wang, et al.
Release Date: 6/4/2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual
Understanding and Generation
Bin Lin, Zongjian Li, Xinhua Cheng, et al.
Release Date: 6/4/2025

EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation
with Large Multimodal Models
Yan Shu, Bin Ren, Zhitong Xiong, et al.
Release Date: 6/4/2025

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware
Reinforcement Learning
Zhongwei Wan, Zhihao Dou, Che Liu, et al.
Release Date: 6/4/2025

ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and
Understanding
Junliang Ye, Zhengyi Wang, Ruowen Zhao, et al.
Release Date: 6/4/2025

LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon
Embodied Tasks
Yang, Yi ; Sun, Jiaxuan ; Kou, et al.
Release Date: 6/4/2025

Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion
Models
Kinam Kim, Junha Hyung, Jaegul Choo
Release Date: 6/3/2025

Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with
Jigsaw Puzzles
Wang, Zifu ; Zhu, Junyi ; Tang, et al.
Release Date: 6/3/2025