HyperAIHyperAI

Command Palette

Search for a command to run...

SuperGPQA Subject Area Assessment Benchmark Dataset

Date

8 months ago

Organization

Paper URL

arxiv.org

License

Apache 2.0

Join the Discord Community

SuperGPQA is a benchmark dataset for evaluating the performance of advanced question answering systems. It was developed by the Multimodal Art Projection team in 2025. The related paper results are "SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines". This dataset focuses on the field of natural language processing and machine learning evaluation, and aims to test the model's reasoning ability and knowledge level through complex interdisciplinary problems.

The dataset covers 285 graduate-level subject areas with diverse question types, including biology, physics, chemistry and other scientific fields.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp