Latest Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

CodeDiffuser: Attention-Enhanced Diffusion Policy via VLM-Generated Code for Instruction Ambiguity
Guang Yin, Yitong Li, Yixuan Wang, et al.
16 days ago

Optimizing Multilingual Text-To-Speech with Accents & Emotions
Pawar, Pranav ; Dwivedi, Akshansh ; Boricha, et al.
16 days ago

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Li, Jiaqi ; Tang, Junshu ; Xu, et al.
16 days ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement
Learning
Kang, Li ; Song, Xiufeng ; Zhou, et al.
16 days ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models
Zhao, Tianchen ; Hong, Ke ; Yang, et al.
16 days ago

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding
Tripathi, Vishesh ; Odapally, Tanmay ; Das, et al.
16 days ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
Liang, Zhiyuan ; Tang, Dongwen ; Zhou, et al.
16 days ago

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Aggarwal, Anirud ; Shrivastava, Abhinav ; Gwilliam, et al.
17 days ago

RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
Xu, Xinnuo ; Lawrence, Rachel ; Dubey, et al.
17 days ago

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
Chopra, Anuradha ; Roy, Abhinaba ; Herremans, et al.
17 days ago