Command Palette
Search for a command to run...
XQuAD Cross-Lingual Question Answering Dataset
Date
Size
Paper URL
License
CC BY 4.0
XQuAD (Cross-Lingual Question Answering Dataset) is a benchmark dataset for evaluating cross-lingual question answering performance. SQuAD v1.1 (Rajpurkar et al., 2016)The dataset consists of a subset of 240 passages and 1,190 question-answer pairs from the development set, which have been professionally translated into ten languages: Spanish, German, Greek, Russian, Turkish, Arabic, Vietnamese, Thai, Chinese, and Hindi. Thus, the dataset is fully parallelizable across 11 languages.
For details on how the dataset was created, please refer to the paper "On the Cross-lingual Transferability of Monolingual Representations".
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.