Date

2 years ago

Size

3.09 MB

Organization

Publish URL

github.com

Tags

LLM

Intelligent Question Answering

Reasoning

The Pinocchio dataset was jointly created by researchers from Tsinghua University, University of Illinois at Chicago, and University of Cambridge. Its purpose is to comprehensively evaluate the performance of large language models (LLMs) in factual knowledge storage and reasoning capabilities. **This dataset covers 20,000 diverse factual questions covering different sources, timelines, domains, regions, and languages.**The dataset contains 7 different tasks to test LLMs’ ability to reason over multiple facts, handle structured and unstructured knowledge, identify subtle factual differences, and resist adversarial examples. Pinocchio provides researchers with a powerful tool to understand the capabilities of models at multiple levels while pushing the boundaries of LLMs’ ability to advance factual knowledge.

Pinocchio.torrent

Seeding 1Downloading 0Completed 130Total Downloads 193

Pinocchio/
- README.md
  1.46 KB
- README.txt
  2.92 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

2 years ago

Size

3.09 MB

Organization

Publish URL

github.com

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

2 months ago

Groundsource Global Flood Events Dataset

3 months ago

CHIMERA General Inference Synthetic Dataset

4 months ago

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

5 months ago

RoVid-X Robot Video Generation Dataset

2 months ago

Patient Segmentation Dataset

5 months ago

TxT360-3efforts Multi-Task Inference Dataset

5 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Pinocchio Pinocchio Factual Knowledge Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Pinocchio Pinocchio Factual Knowledge Evaluation Dataset

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset

TxT360-3efforts Multi-Task Inference Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Pinocchio Pinocchio Factual Knowledge Evaluation Dataset

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset

TxT360-3efforts Multi-Task Inference Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset

TxT360-3efforts Multi-Task Inference Dataset

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset

TxT360-3efforts Multi-Task Inference Dataset