HyperAI超神経

最新論文

日々更新される最先端AI研究論文、人工知能の最新動向を把握

LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision
  Foundation Models
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Haiwen Huang, Anpei Chen, Volodymyr Havrylov, et al.
公開日: 4/23/2025
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
Joya Chen, Ziyun Zeng, Yiqi Lin, et al.
公開日: 4/23/2025
SilVar-Med: A Speech-Driven Visual Language Model for Explainable
  Abnormality Detection in Medical Imaging
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham, Chris Ngo, Trong-Duong Bui, et al.
公開日: 4/23/2025
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom
  Production Large Language Model Pipelines
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines
Reya Vir, Shreya Shankar, Harrison Chase, et al.
公開日: 4/23/2025
BookWorld: From Novels to Interactive Agent Societies for Creative Story
  Generation
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation
Yiting Ran, Xintao Wang, Tian Qiu, et al.
公開日: 4/23/2025
Progent: Programmable Privilege Control for LLM Agents
Progent: Programmable Privilege Control for LLM Agents
Tianneng Shi, Jingxuan He, Zhun Wang, et al.
公開日: 4/23/2025
WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World
  Model-based LLM Agents
WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents
Siyu Zhou, Tianyi Zhou, Yijun Yang, et al.
公開日: 4/23/2025
CoMotion: Concurrent Multi-person 3D Motion
CoMotion: Concurrent Multi-person 3D Motion
Alejandro Newell, Peiyun Hu, Lahav Lipson, et al.
公開日: 4/23/2025
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and
  CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection
  in Complex Orchard Environments Under Label Ambiguity
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity
Ranjan Sapkota, Rahul Harsha Cheppally, Ajay Sharda, et al.
公開日: 4/23/2025
From Reflection to Perfection: Scaling Inference-Time Optimization for
  Text-to-Image Diffusion Models via Reflection Tuning
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
Le Zhuo, Liangbing Zhao, Sayak Paul, et al.
公開日: 4/23/2025