6 months ago

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki

Abstract

We present the Natural Questions corpus, a question answering dataset. Questions consist of real anonymized, aggregated queries issued to the Google search engine. An annotator is presented with a question along with a Wikipedia page from the top 5 search results, and annotates a long answer (typically a paragraph) and a short answer (one or more entities) if present on the page, or marks null if no long/short answer is present. The public release consists of 307,373 training examples with single annotations, 7,830 examples with 5-way annotations for development data, and a further 7,842 examples 5-way annotated sequestered as test data. We present experiments validating quality of the data. We also describe analysis of 25-way annotations on 302 examples, giving insights into human variability on the annotation task. We introduce robust metrics for the purposes of evaluating question answering systems; demonstrate high human upper bounds on these metrics; and establish baseline results using competitive methods drawn from related literature.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

6 months ago

Intelligent Question Answering

Dataset

Benchmarks

AI Infra

Natural Language Processing

Task/Problem

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

6 months ago

Intelligent Question Answering

Dataset

Benchmarks

AI Infra

Natural Language Processing

Task/Problem

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Natural Questions: a Benchmark for Question Answering Research

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki8 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Natural Questions: a Benchmark for Question Answering Research

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki8 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Natural Questions: a Benchmark for Question Answering Research

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki8 more

Abstract

Build AI with AI

HyperAI Newsletters

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki

Jakob Uszkoreit Andrew M. Dai Ming-Wei Chang Chris Alberti Tom Kwiatkowski Slav Petrov Michael Collins Matthew Kelcey Llion Jones Jennimaria Palomaki