DuReader Large-scale open-domain Chinese Machine Reading Comprehension Dataset
Date
3 years ago
Size
4.11 GB
Publish URL
Paper URL
License
Other
Tags

DuReader is a large-scale open-domain Chinese dataset for machine reading comprehension, which can be used to train or evaluate machine reading comprehension models and systems.
The dataset consists of 200,000 questions, 420,000 answers, and 1 million documents. The questions and documents are based on Baidu Search and Baidu Knows, and the answers are manually generated. The dataset also provides annotations on the question type, and each question is manually labeled with its classification: Entity, Description, YesNo, Fact or Opinion.
DuReader.torrent
Seeding 2Downloading 0Completed 423Total Downloads 747
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.
AI Co-coding
Ready-to-use GPUs
Best Pricing
Hyper Newsletters
Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp