Dataset Open Access

Catalan United Nations v1.0 test set

Marta R. Costa-jussà

Catalan version [1] of the test set from the United Nations v1.0 [2]. The translation was performed in two steps: we did a first automatic translation from the Spanish test set version into Catalan and then a professional translator post-edited the output.


[1] Marta R. Costa-Jussà, Noé Casas, Carlos Escolano, and José A. R. Fonollosa. 2019. Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 18, 4, Article 43 (August 2019), 8 pages. DOI:https://doi.org/10.1145/3312575

[2] Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In
Proceedings of the LREC, 2016

This work is supported by the Spanish Ministerio de Economía y Competitividad and European Regional Development Fund, through the postdoctoral senior grant Ramón y Cajal.
Files (820.4 kB)
Name Size
UN_test_ca.txt
md5:71dfb8ec7ea6a2bdb4615083f2eb3b71
820.4 kB Download
  • Costa-jussà, M.R., Casas, N., Escolano, C. and Fonollosa, J.A.R., Chinese-Catalan: A Neural Machine Translation Approach based on Pivoting and Attention Mechanisms, ACM Transactions on Asian and Low-Resource Language Information Processing, Vol 18, No 4, Art. 43, 2019

  • Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In Proceedings of the LREC, 2016

8
5
views
downloads
All versions This version
Views 88
Downloads 55
Data volume 4.1 MB4.1 MB
Unique views 77
Unique downloads 55

Share

Cite as