HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
6 months ago
Benchmarks
Preference Modeling
Reasoning
AI Infra
Method/Architecture
Natural Language Processing
Task/Problem
Summary
Paper
Benchmarks
Resources
opengvlab/multi-modality-arena
pytorch
lm-sys/routellm
pytorch
formulamonks/llm-benchmarker-suite
pytorch
ojiyumm/mt_bench_rwkv
pytorch
lm-sys/fastchat
Official
pytorch
ilyagusev/ping_pong_bench
theoremone/llm-benchmarker-suite
pytorch
PAIR-code/llm-comparator
tf
kuk/rulm-sbs2
dongping-chen/mllm-as-a-judge
pytorch
bjoernpl/fasteval
HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
6 months ago
Benchmarks
Preference Modeling
Reasoning
AI Infra
Method/Architecture
Natural Language Processing
Task/Problem
Summary
Paper
Benchmarks
Resources
opengvlab/multi-modality-arena
pytorch
lm-sys/routellm
pytorch
formulamonks/llm-benchmarker-suite
pytorch
ojiyumm/mt_bench_rwkv
pytorch
lm-sys/fastchat
Official
pytorch
ilyagusev/ping_pong_bench
theoremone/llm-benchmarker-suite
pytorch
PAIR-code/llm-comparator
tf
kuk/rulm-sbs2
dongping-chen/mllm-as-a-judge
pytorch
bjoernpl/fasteval
556
556
4.6k
4.6k
48
48
0
0
39.4k
39.4k
115
115
48
48
520
520
61
61
89
89
1
1