HyperAI

AI SOTA Benchmarks

Latest AI model performance metrics, GPU benchmarks, and cutting-edge papers

Categories

Browse tasks by category

AI Model Performance Benchmarks

Performance metrics of mainstream AI models across various tasks, showcasing the state-of-the-art technology

Red Teaming

47 papers | 0 benchmarks

Backdoor Attack

36 papers | 0 benchmarks

Adversarial Defense

34 papers | 10 benchmarks

Handwritten Text Recognition

32 papers | 13 benchmarks

Open-Domain Question Answering

30 papers | 15 benchmarks

Target Speaker Extraction

49 papers | 0 benchmarks

Inference Optimization

48 papers | 0 benchmarks

Room Impulse Response (RIR)

46 papers | 0 benchmarks

Bandwidth Extension

45 papers | 6 benchmarks

Voice Cloning

44 papers | 0 benchmarks

Type prediction

44 papers | 3 benchmarks

Compiler Optimization

44 papers | 0 benchmarks

Chart Question Answering

41 papers | 3 benchmarks

Traffic Signal Control

40 papers | 0 benchmarks

Code Translation

37 papers | 2 benchmarks

Image Retrieval

50 papers | 56 benchmarks

Rgb-T Tracking

50 papers | 5 benchmarks

Colorization

50 papers | 2 benchmarks

Color Constancy

50 papers | 1 benchmarks

Human Dynamics

50 papers | 0 benchmarks

Graph Sampling

49 papers | 0 benchmarks

Graph Property Prediction

45 papers | 4 benchmarks

Jet Tagging

44 papers | 1 benchmarks

Triple Classification

44 papers | 1 benchmarks

Style Transfer

42 papers | 3 benchmarks

Ontology Matching

50 papers | 0 benchmarks

Explainable Artificial Intelligence (XAI)

49 papers | 1 benchmarks

Document Summarization

46 papers | 7 benchmarks

Knowledge Graphs

44 papers | 4 benchmarks

Knowledge Base Construction

44 papers | 0 benchmarks

multimodal

79 papers | 79 benchmarks

reasoning

61 papers | 57 benchmarks

understanding

47 papers | 48 benchmarks

other

35 papers | 33 benchmarks

knowledge

27 papers | 30 benchmarks

SSVEP

50 papers | 0 benchmarks

Pharmacovigilance

50 papers | 0 benchmarks

Skin Lesion Segmentation

48 papers | 3 benchmarks

Diabetic Retinopathy Detection

48 papers | 1 benchmarks

Metal Artifact Reduction

48 papers | 0 benchmarks

Bilevel Optimization

50 papers | 3 benchmarks

Classification

49 papers | 71 benchmarks

Computational Efficiency

49 papers | 1 benchmarks

Inductive Learning

49 papers | 0 benchmarks

Entity Embeddings

48 papers | 0 benchmarks

Deep Clustering

50 papers | 5 benchmarks

Physical Simulations

50 papers | 5 benchmarks

Multimodal Recommendation

50 papers | 5 benchmarks

Electrical Engineering

50 papers | 1 benchmarks

Misinformation

49 papers | 1 benchmarks

Music Classification

49 papers | 0 benchmarks

Music Information Retrieval

44 papers | 0 benchmarks

Voice Conversion

41 papers | 3 benchmarks

Music Transcription

40 papers | 6 benchmarks

Video Style Transfer

35 papers | 0 benchmarks

Word Alignment

50 papers | 7 benchmarks

Deep Clustering

50 papers | 5 benchmarks

Semantic Dependency Parsing

50 papers | 3 benchmarks

Sentence Ordering

49 papers | 1 benchmarks

Lemmatization

49 papers | 0 benchmarks

Offline RL

48 papers | 2 benchmarks

Car Racing

48 papers | 0 benchmarks

Real-Time Strategy Games

46 papers | 0 benchmarks

Game Design

43 papers | 0 benchmarks

Video Style Transfer

35 papers | 0 benchmarks

ARC

50 papers | 0 benchmarks

Discrete Choice Models

50 papers | 0 benchmarks

3D Human Reconstruction

48 papers | 10 benchmarks

Causal Identification

46 papers | 0 benchmarks

Common Sense Reasoning

45 papers | 24 benchmarks

Gesture Generation

47 papers | 4 benchmarks

Trajectory Planning

47 papers | 2 benchmarks

Robot Task Planning

46 papers | 3 benchmarks

Benchmarking

45 papers | 2 benchmarks

Visual Odometry

45 papers | 1 benchmarks

Spoken language identification

50 papers | 12 benchmarks

Speech Dereverberation

50 papers | 5 benchmarks

Acoustic Modelling

50 papers | 0 benchmarks

Speech Separation

49 papers | 19 benchmarks

Spoken Dialogue Systems

47 papers | 0 benchmarks

Time Series Prediction

50 papers | 2 benchmarks

Time Series Forecasting

49 papers | 86 benchmarks

Computational Efficiency

49 papers | 1 benchmarks

Activity Prediction

48 papers | 1 benchmarks

Predictive Process Monitoring

48 papers | 0 benchmarks

GPU Benchmarks

Latest GPU hardware and software performance evaluations to help you make informed hardware choices

Software Performance

DeepSeek-R1-Distill-Qwen-7B
Environment: vllm
DeepSeek-R1-Distill-Llama-8B
Environment: vllm
DeepSeek-R1-Distill-Qwen-14B
Environment: vllm
DeepSeek-R1-Distill-Qwen-32B
Environment: vllm
DeepSeek-R1-Distill-Llama-70B
Environment: vllm
DeepSeek-R1-Distill-Qwen-7B
Environment: sglang
DeepSeek-R1-Distill-Llama-8B
Environment: sglang
DeepSeek-R1-Distill-Qwen-14B
Environment: sglang
DeepSeek-R1-Distill-Qwen-32B
Environment: sglang
DeepSeek-R1-Distill-Llama-70B
Environment: sglang