最新論文
日々更新される最先端AI研究論文、人工知能の最新動向を把握

LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision
Foundation Models
Haiwen Huang, Anpei Chen, Volodymyr Havrylov, et al.
公開日: 4/23/2025

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
Joya Chen, Ziyun Zeng, Yiqi Lin, et al.
公開日: 4/23/2025

SilVar-Med: A Speech-Driven Visual Language Model for Explainable
Abnormality Detection in Medical Imaging
Tan-Hanh Pham, Chris Ngo, Trong-Duong Bui, et al.
公開日: 4/23/2025

PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom
Production Large Language Model Pipelines
Reya Vir, Shreya Shankar, Harrison Chase, et al.
公開日: 4/23/2025

BookWorld: From Novels to Interactive Agent Societies for Creative Story
Generation
Yiting Ran, Xintao Wang, Tian Qiu, et al.
公開日: 4/23/2025

Progent: Programmable Privilege Control for LLM Agents
Tianneng Shi, Jingxuan He, Zhun Wang, et al.
公開日: 4/23/2025

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World
Model-based LLM Agents
Siyu Zhou, Tianyi Zhou, Yijun Yang, et al.
公開日: 4/23/2025

CoMotion: Concurrent Multi-person 3D Motion
Alejandro Newell, Peiyun Hu, Lahav Lipson, et al.
公開日: 4/23/2025

RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and
CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection
in Complex Orchard Environments Under Label Ambiguity
Ranjan Sapkota, Rahul Harsha Cheppally, Ajay Sharda, et al.
公開日: 4/23/2025

From Reflection to Perfection: Scaling Inference-Time Optimization for
Text-to-Image Diffusion Models via Reflection Tuning
Le Zhuo, Liangbing Zhao, Sayak Paul, et al.
公開日: 4/23/2025