AI SOTA Benchmarks
Latest AI model performance metrics, GPU benchmarks, and cutting-edge papers
Categories
Browse tasks by category
AI Model Performance Benchmarks
Performance metrics of mainstream AI models across various tasks, showcasing the state-of-the-art technology
Red Teaming
47 papers | 0 benchmarks
Backdoor Attack
36 papers | 0 benchmarks
Adversarial Defense
34 papers | 10 benchmarks
Handwritten Text Recognition
32 papers | 13 benchmarks
Open-Domain Question Answering
30 papers | 15 benchmarks
Target Speaker Extraction
49 papers | 0 benchmarks
Inference Optimization
48 papers | 0 benchmarks
Room Impulse Response (RIR)
46 papers | 0 benchmarks
Bandwidth Extension
45 papers | 6 benchmarks
Voice Cloning
44 papers | 0 benchmarks
Type prediction
44 papers | 3 benchmarks
Compiler Optimization
44 papers | 0 benchmarks
Chart Question Answering
41 papers | 3 benchmarks
Traffic Signal Control
40 papers | 0 benchmarks
Code Translation
37 papers | 2 benchmarks
Image Retrieval
50 papers | 56 benchmarks
Rgb-T Tracking
50 papers | 5 benchmarks
Colorization
50 papers | 2 benchmarks
Color Constancy
50 papers | 1 benchmarks
Human Dynamics
50 papers | 0 benchmarks
Graph Sampling
49 papers | 0 benchmarks
Graph Property Prediction
45 papers | 4 benchmarks
Jet Tagging
44 papers | 1 benchmarks
Triple Classification
44 papers | 1 benchmarks
Style Transfer
42 papers | 3 benchmarks
Ontology Matching
50 papers | 0 benchmarks
Explainable Artificial Intelligence (XAI)
49 papers | 1 benchmarks
Document Summarization
46 papers | 7 benchmarks
Knowledge Graphs
44 papers | 4 benchmarks
Knowledge Base Construction
44 papers | 0 benchmarks
multimodal
79 papers | 79 benchmarks
reasoning
61 papers | 57 benchmarks
understanding
47 papers | 48 benchmarks
other
35 papers | 33 benchmarks
knowledge
27 papers | 30 benchmarks
SSVEP
50 papers | 0 benchmarks
Pharmacovigilance
50 papers | 0 benchmarks
Skin Lesion Segmentation
48 papers | 3 benchmarks
Diabetic Retinopathy Detection
48 papers | 1 benchmarks
Metal Artifact Reduction
48 papers | 0 benchmarks
Bilevel Optimization
50 papers | 3 benchmarks
Classification
49 papers | 71 benchmarks
Computational Efficiency
49 papers | 1 benchmarks
Inductive Learning
49 papers | 0 benchmarks
Entity Embeddings
48 papers | 0 benchmarks
Deep Clustering
50 papers | 5 benchmarks
Physical Simulations
50 papers | 5 benchmarks
Multimodal Recommendation
50 papers | 5 benchmarks
Electrical Engineering
50 papers | 1 benchmarks
Misinformation
49 papers | 1 benchmarks
Music Classification
49 papers | 0 benchmarks
Music Information Retrieval
44 papers | 0 benchmarks
Voice Conversion
41 papers | 3 benchmarks
Music Transcription
40 papers | 6 benchmarks
Video Style Transfer
35 papers | 0 benchmarks
Word Alignment
50 papers | 7 benchmarks
Deep Clustering
50 papers | 5 benchmarks
Semantic Dependency Parsing
50 papers | 3 benchmarks
Sentence Ordering
49 papers | 1 benchmarks
Lemmatization
49 papers | 0 benchmarks
Offline RL
48 papers | 2 benchmarks
Car Racing
48 papers | 0 benchmarks
Real-Time Strategy Games
46 papers | 0 benchmarks
Game Design
43 papers | 0 benchmarks
Video Style Transfer
35 papers | 0 benchmarks
ARC
50 papers | 0 benchmarks
Discrete Choice Models
50 papers | 0 benchmarks
3D Human Reconstruction
48 papers | 10 benchmarks
Causal Identification
46 papers | 0 benchmarks
Common Sense Reasoning
45 papers | 24 benchmarks
Gesture Generation
47 papers | 4 benchmarks
Trajectory Planning
47 papers | 2 benchmarks
Robot Task Planning
46 papers | 3 benchmarks
Benchmarking
45 papers | 2 benchmarks
Visual Odometry
45 papers | 1 benchmarks
Spoken language identification
50 papers | 12 benchmarks
Speech Dereverberation
50 papers | 5 benchmarks
Acoustic Modelling
50 papers | 0 benchmarks
Speech Separation
49 papers | 19 benchmarks
Spoken Dialogue Systems
47 papers | 0 benchmarks
Time Series Prediction
50 papers | 2 benchmarks
Time Series Forecasting
49 papers | 86 benchmarks
Computational Efficiency
49 papers | 1 benchmarks
Activity Prediction
48 papers | 1 benchmarks
Predictive Process Monitoring
48 papers | 0 benchmarks
GPU Benchmarks
Latest GPU hardware and software performance evaluations to help you make informed hardware choices
Software Performance
DeepSeek-R1-Distill-Qwen-7B
Environment: vllm
DeepSeek-R1-Distill-Llama-8B
Environment: vllm
DeepSeek-R1-Distill-Qwen-14B
Environment: vllm
DeepSeek-R1-Distill-Qwen-32B
Environment: vllm
DeepSeek-R1-Distill-Llama-70B
Environment: vllm
DeepSeek-R1-Distill-Qwen-7B
Environment: sglang
DeepSeek-R1-Distill-Llama-8B
Environment: sglang
DeepSeek-R1-Distill-Qwen-14B
Environment: sglang
DeepSeek-R1-Distill-Qwen-32B
Environment: sglang
DeepSeek-R1-Distill-Llama-70B
Environment: sglang