4 months ago
GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation
{Zheng Zhang Xipeng Qiu Qipeng Guo Zhijing Jin}

Abstract
Data collection for the knowledge graph-to-text generation is expensive. As a result, research on unsupervised models has emerged as an active field recently. However, most unsupervised models have to use non-parallel versions of existing small supervised datasets, which largely constrain their potential. In this paper, we propose a large-scale, general-domain dataset, GenWiki. Our unsupervised dataset has 1.3M text and graph examples, respectively. With a human-annotated test set, we provide this new benchmark dataset for future research on unsupervised text generation from knowledge graphs.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| unsupervised-kg-to-text-generation-on-genwiki | CycleGT_Warm | BLEU: 41.35 CIDEr: 3.45 METEOR: 35.20 ROUGE-L: 63.01 |
| unsupervised-kg-to-text-generation-on-genwiki | Rule-Based | BLEU: 13.45 CIDEr: 1.26 METEOR: 30.72 ROUGE-L: 40.93 |
| unsupervised-kg-to-text-generation-on-genwiki | NoisySupervised | BLEU: 30.12 CIDEr: 2.52 METEOR: 28.12 ROUGE-L: 56.96 |
| unsupervised-kg-to-text-generation-on-genwiki | CycleGT_Base | BLEU: 41.59 CIDEr: 3.57 METEOR: 35.72 ROUGE-L: 63.31 |
| unsupervised-kg-to-text-generation-on-genwiki | DirectTransfer | BLEU: 13.89 CIDEr: 1.26 METEOR: 25.76 ROUGE-L: 39.75 |
| unsupervised-kg-to-text-generation-on-genwiki-1 | CycleGT_Warm | BLEU: 40.47 CIDEr: 3.48 METEOR: 34.84 ROUGE-L: 63.40 |
| unsupervised-kg-to-text-generation-on-genwiki-1 | CycleGT_Base | BLEU: 41.29 CIDEr: 3.53 METEOR: 35.39 ROUGE-L: 63.73 |
| unsupervised-kg-to-text-generation-on-genwiki-1 | DirectTransfer | BLEU: 13.89 CIDEr: 1.26 METEOR: 25.76 ROUGE-L: 39.75 |
| unsupervised-kg-to-text-generation-on-genwiki-1 | Rule-Based | BLEU: 13.45 CIDEr: 1.26 METEOR: 30.72 ROUGE-L: 40.93 |
| unsupervised-kg-to-text-generation-on-genwiki-1 | NoisySupervised | BLEU: 35.03 CIDEr: 2.63 METEOR: 33.45 ROUGE-L: 58.14 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.
AI Co-coding
Ready-to-use GPUs
Best Pricing
Hyper Newsletters
Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp