Papers

Dian Zheng, Harry Lee, Manyuan Zhang, et al.

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Jiacheng Chen, Xinyu Zhang, Shunkai Zhang, et al.

Text Generation

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Seokju Cho, Ryo Hachiuma, Abhishek Badki, et al.

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Wanli Li, Bowen Zhou, Yunyao Yu, et al.

Benchmarks

MiniMax Sparse Attention

Xunhao Lai, Weiqi Xu, Yufeng Yang, et al.

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Jundong Xu, Qingchuan Li, Jiaying Wu, et al.

Flex4DHuman: Flexible Multi-view Video Diffusion for 4D Human Reconstruction

Jen-Hao Cheng, Yipeng Wang, Hao Zhang, et al.

Modality Forcing for Scalable Spatial Generation

Bardienus Pieter Duisterhof, Deva Ramanan, Jeffrey Ichnowski, et al.

Image Generation

From AGI to ASI

Artificial Intelligence

Tim Genewein, Matija Franklin, Alexander Lerchner, et al.

World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible

3D Generation

Hao Zhang, Mohamed El Banani, Jen-Hao Cheng, et al.

Regularized f-Divergence Kernel Tests

Deep Learning

Mónica Ribero, Antonin Schrab, Arthur Gretton

Pretraining Recurrent Networks without Recurrence

Transformer

Model Training

Akarsh Kumar

Trajectory-Refined Distillation

Li Jiang, Haoran Xu, Yichuan Ding, et al.

Reinforcement Learning

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

Video Understanding

Cong Chen, Guo Gan, Kaixiang Ji, et al.

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Pu Ning, Quan Chen, Kun Tao, et al.

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Wenbo Pan, Shujie Liu, Chin-Yew Lin, et al.

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

Xucong Wang, Ziyu Ma, Shidong Yang, et al.

ABot-Earth 0.5: Generative 3D Earth Model

3D Generation

3D Model

Ming Qian, Tianjian Ouyang, Mingchao Sun, et al.

Kwai Keye-VL-2.0 Technical Report

Video Understanding

Kwai Keye Team, Bin Wen, Changyi Liu, et al.

TESSERA: Temporal Embeddings of Surface Spectra for Earth Representation and Analysis

Multimodal Representation

Deep Learning

Zhengpeng Feng, Clement Atzberger, Sadiq Jaffer, et al.

If LLMs have human-like attributes, then so does Age of Empires II

Adrian de Wynter

The Last Human-Written Paper: Agent-Native Research Artifacts

Jiachen Liu, Jiaxin Pei, Jintao Huang, et al.

AI for Science

FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention

Yan Wang, Qifan Zhang, Jiachen Yu, et al.

DeepSeek

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

Aofan Yu, Chenyu Zhou, Tianyi Xu, et al.

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Text-to-Video

Jiangtao Wu, Jiaming Wang, Yiwen He, et al.

Latent Spatial Memory for Video World Models

Weijie Wang, Haoyu Zhao, Yifan Yang, et al.

On the Geometry of On-Policy Distillation

Zhennan Shen, Yanshu Li, Qingyu Yin, et al.

Model Training

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Shaoqiu Zhang, Yuhang Wang, Jialiang Liang, et al.

Code Generation

VoxCPM2 Technical Report

Text-to-Speech

VoxCPM Team

LongCat-Video-Avatar 1.5 Technical Report

Meituan LongCat Team

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding