Published November 26, 2017
| Version v1
Dataset
Open
EuroparlExtract - Directional Parallel Corpora Extracted from the European Parliament Proceedings Parallel Corpus
Description
This dataset contains directional parallel corpora extracted from the European Parliament Proceedings Corpus (Europarl) v7 created by Philipp Koehn (see http://www.statmt.org/europarl/). For the extraction, the EuroparlExtract corpus processing toolkit by Michael Ustszewski (2017) was used. EuroparlExtract is freely available under the MIT License (see https://github.com/mustaszewski/europarl-extract).
Files
Files
(2.6 GB)
Name | Size | Download all |
---|---|---|
md5:4e7d73c4f690faac09870310b343638a
|
2.6 GB | Download |