Llm
Performance metrics of mainstream AI models across various tasks, showcasing the state-of-the-art technology
AI Model Performance Benchmarks
Performance metrics of mainstream AI models across various tasks, showcasing the state-of-the-art technology
multimodal
78 papers | 0 benchmarks
reasoning
60 papers | 0 benchmarks
understanding
47 papers | 0 benchmarks
other
35 papers | 0 benchmarks
knowledge
27 papers | 0 benchmarks
agent
24 papers | 0 benchmarks
code
20 papers | 0 benchmarks
math
20 papers | 0 benchmarks
language
19 papers | 0 benchmarks
examination
17 papers | 0 benchmarks
safety
17 papers | 0 benchmarks
strong reasoning
15 papers | 0 benchmarks
long-context
10 papers | 0 benchmarks
creation
6 papers | 0 benchmarks
instruct
5 papers | 0 benchmarks