Published June 10, 2020 | Version v1
Dataset Open

Catalan United Nations v1.0 test set

  • 1. Universitat Politècnica de Catalunya

Description

Catalan version [1] of the test set from the United Nations v1.0 [2]. The translation was performed in two steps: we did a first automatic translation from the Spanish test set version into Catalan and then a professional translator post-edited the output.


[1] Marta R. Costa-Jussà, Noé Casas, Carlos Escolano, and José A. R. Fonollosa. 2019. Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 18, 4, Article 43 (August 2019), 8 pages. DOI:https://doi.org/10.1145/3312575

[2] Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In
Proceedings of the LREC, 2016

Notes

This work is supported by the Spanish Ministerio de Economía y Competitividad and European Regional Development Fund, through the postdoctoral senior grant Ramón y Cajal.

Files

UN_test_ca.txt

Files (820.4 kB)

Name Size Download all
md5:71dfb8ec7ea6a2bdb4615083f2eb3b71
820.4 kB Preview Download

Additional details

Related works

Cites
Conference paper: https://www.aclweb.org/anthology/L16-1561 (URL)
Is compiled by
Journal article: 10.1145/3312575 (DOI)

References

  • Costa-jussà, M.R., Casas, N., Escolano, C. and Fonollosa, J.A.R., Chinese-Catalan: A Neural Machine Translation Approach based on Pivoting and Attention Mechanisms, ACM Transactions on Asian and Low-Resource Language Information Processing, Vol 18, No 4, Art. 43, 2019
  • Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In Proceedings of the LREC, 2016