Dataset Open Access

Catalan United Nations v1.0 test set

Marta R. Costa-jussà

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.3888414", 
  "container_title": "ACM Transactions on Asian and Low-Resource Language Information Processing", 
  "language": "cat", 
  "title": "Catalan United Nations v1.0 test set", 
  "issued": {
    "date-parts": [
  "abstract": "<p>Catalan version [1] of the test set from the United Nations v1.0 [2]. The translation was performed in two steps: we did a first automatic translation from the Spanish test set version into Catalan and then a professional translator post-edited the output.</p>\n\n<p><br>\n[1] Marta R. Costa-Juss&agrave;, No&eacute; Casas, Carlos Escolano, and Jos&eacute; A. R. Fonollosa. 2019. Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms. <em>ACM Trans. Asian Low-Resour. Lang. Inf. Process.</em> 18, 4, Article 43 (August 2019), 8 pages. DOI:</p>\n\n<p>[2] Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In<br>\nProceedings of the LREC, 2016</p>", 
  "author": [
      "family": "Marta R. Costa-juss\u00e0"
  "volume": "18", 
  "note": "This work is supported by the Spanish Ministerio de Econom\u00eda y Competitividad and European Regional Development\nFund, through the postdoctoral senior grant Ram\u00f3n y Cajal.", 
  "type": "dataset", 
  "issue": "4", 
  "id": "3888414"
All versions This version
Views 4040
Downloads 1010
Data volume 8.2 MB8.2 MB
Unique views 3838
Unique downloads 99


Cite as