XQuAD Cross-Lingual Question Answering Dataset
Date
a year ago
Size
14 MB
License
CC BY 4.0
XQuAD (Cross-Lingual Question Answering Dataset) is a benchmark dataset for evaluating cross-lingual question answering performance. SQuAD v1.1 (Rajpurkar et al., 2016)The dataset consists of a subset of 240 passages and 1,190 question-answer pairs from the development set, which have been professionally translated into ten languages: Spanish, German, Greek, Russian, Turkish, Arabic, Vietnamese, Thai, Chinese, and Hindi. Thus, the dataset is fully parallelizable across 11 languages.
For details on how the dataset was created, please refer to the paper "On the Cross-lingual Transferability of Monolingual Representations".
XQuAD.torrent
Seeding 2Downloading 0Completed 121Total Downloads 287