Published May 13, 2020 | Version v1
Conference paper Open

Dataset for Temporal Analysis of English-French Cognates

  • 1. University of La Rochelle, L3i Laboratory,
  • 2. Kyoto University
  • 3. University of Helsinki

Description

Languages change over time and, thanks to abundance of digital corpora, their evolutionary analysis using computational techniques has recently gained much research attention. In this paper, we focus on creating a database to investigate the similarity in evolution between different languages. We look in particular into the similarities and differences between the use of corresponding words across time in English and French, two languages from different linguistic families yet with shared syntax and close contact. To analyze this evolution, we select a set of cognates in both languages and study their temporal changes and correlations. We propose a new database for computational approaches of synchronized diachronic investigation of language pairs, and subsequent novel findings stemming from the cognates temporal comparison of the two chosen languages. To the best of our knowledge, the present study is the first in the literature to use computational approaches and large data to make a cross-language temporal analysis.

Files

LREC_Temporal_Study.pdf

Files (649.6 kB)

Name Size Download all
md5:dfa800d83f8db46c2e7a75944c723980
649.6 kB Preview Download

Additional details

Funding

NewsEye – NewsEye: A Digital Investigator for Historical Newspapers 770299
European Commission
EMBEDDIA – Cross-Lingual Embeddings for Less-Represented Languages in European News Media 825153
European Commission