Published November 26, 2017 | Version v1
Dataset Open

EuroparlExtract - Directional Parallel Corpora Extracted from the European Parliament Proceedings Parallel Corpus

  • 1. University of Innsbruck

Description

This dataset contains directional parallel corpora extracted from the European Parliament Proceedings Corpus (Europarl) v7 created by Philipp Koehn (see http://www.statmt.org/europarl/). For the extraction, the EuroparlExtract corpus processing toolkit by Michael Ustszewski (2017) was used. EuroparlExtract is freely available under the MIT License (see https://github.com/mustaszewski/europarl-extract).

Files

Files (2.6 GB)

Name Size Download all
md5:4e7d73c4f690faac09870310b343638a
2.6 GB Download