Dataset Open Access
Mónica Marrero; Antoine Isaac; Nuno Freire
The dataset contains all the data required to reproduce the experiments done in the paper "Automatic translation and multilingual cultural heritage retrieval: a case study with transcriptions in Europeana", published in the 25th International Conference on Theory and Practice of Digital Libraries (TPDL'21). In that work we run an experiment using the Europeana CH digital library as a use case, and we evaluated the effectiveness of a multilingual information retrieval strategy using machine translations to English as pivot language. We used the CEF translation service (eTranslation) for the translation of queries and content to English (https://ec.europa.eu/cefdigital/wiki/display/CEFDIGITAL/eTranslation).
The dataset is also available at https://rnd-2.eanadev.org/share/crosslingual-search/, and it is organized in four main folders:
Name | Size | |
---|---|---|
crosslingual-search.zip
md5:bae6701105224fddc33da9907accd27e |
34.2 MB | Download |
All versions | This version | |
---|---|---|
Views | 139 | 139 |
Downloads | 4 | 4 |
Data volume | 136.8 MB | 136.8 MB |
Unique views | 124 | 124 |
Unique downloads | 3 | 3 |