Dataset Open Access
Catalan version [1] of the test set from the United Nations v1.0 [2]. The translation was performed in two steps: we did a first automatic translation from the Spanish test set version into Catalan and then a professional translator post-edited the output.
[1] Marta R. Costa-Jussà, Noé Casas, Carlos Escolano, and José A. R. Fonollosa. 2019. Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 18, 4, Article 43 (August 2019), 8 pages. DOI:https://doi.org/10.1145/3312575
[2] Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In
Proceedings of the LREC, 2016
Name | Size | |
---|---|---|
UN_test_ca.txt
md5:71dfb8ec7ea6a2bdb4615083f2eb3b71 |
820.4 kB | Download |
Costa-jussà, M.R., Casas, N., Escolano, C. and Fonollosa, J.A.R., Chinese-Catalan: A Neural Machine Translation Approach based on Pivoting and Attention Mechanisms, ACM Transactions on Asian and Low-Resource Language Information Processing, Vol 18, No 4, Art. 43, 2019
Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In Proceedings of the LREC, 2016
All versions | This version | |
---|---|---|
Views | 39 | 39 |
Downloads | 7 | 7 |
Data volume | 5.7 MB | 5.7 MB |
Unique views | 37 | 37 |
Unique downloads | 7 | 7 |