Date

7 years ago

Size

2.42 GB

Organization

Publish URL

s3.amazonaws.com

Tags

Natural Language Processing

WMT 2015 French/English parallel texts is a French/English parallel text dataset used to train translation models. It has more than 20 million French and English sentences. This dataset was created by Chris Callison-Burch, who crawled millions of web pages and converted French URLs to English URLs using a simple set of heuristics, assuming that these documents are translations of each other. The dataset was jointly released in 2009 by Johns Hopkins University, the University of Edinburgh, and the University of Amsterdam.

WMT 2015 French-English parallel texts.torrent

Seeding 2Downloading 0Completed 1,061Total Downloads 1,714

WMT 2015 French-English parallel texts/
- README.md
  1.15 KB
- README.txt
  2.31 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.