Dataset Open Access

Catalan United Nations v1.0 test set

Marta R. Costa-jussà

JSON-LD ( Export

  "inLanguage": {
    "alternateName": "cat", 
    "@type": "Language", 
    "name": "Catalan"
  "description": "<p>Catalan version [1] of the test set from the United Nations v1.0 [2]. The translation was performed in two steps: we did a first automatic translation from the Spanish test set version into Catalan and then a professional translator post-edited the output.</p>\n\n<p><br>\n[1] Marta R. Costa-Juss&agrave;, No&eacute; Casas, Carlos Escolano, and Jos&eacute; A. R. Fonollosa. 2019. Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms. <em>ACM Trans. Asian Low-Resour. Lang. Inf. Process.</em> 18, 4, Article 43 (August 2019), 8 pages. DOI:</p>\n\n<p>[2] Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In<br>\nProceedings of the LREC, 2016</p>", 
  "license": "", 
  "creator": [
      "affiliation": "Universitat Polit\u00e8cnica de Catalunya", 
      "@id": "", 
      "@type": "Person", 
      "name": "Marta R. Costa-juss\u00e0"
  "url": "", 
  "citation": [
      "@id": "", 
      "@type": "CreativeWork"
  "datePublished": "2020-06-10", 
  "keywords": [
    "Multilingual Parallel Data", 
    "United Nations"
  "@context": "", 
  "distribution": [
      "contentUrl": "", 
      "encodingFormat": "txt", 
      "@type": "DataDownload"
  "identifier": "", 
  "@id": "", 
  "@type": "Dataset", 
  "name": "Catalan United Nations v1.0 test set"
All versions This version
Views 4040
Downloads 1010
Data volume 8.2 MB8.2 MB
Unique views 3838
Unique downloads 99


Cite as