Published July 19, 2021 | Version v1
Report Open

Status of NLP for Vertical Mongolian

  • 1. MIASU, Cambridge University
  • 2. SOAS, University of London
  • 1. SOAS, University of London
  • 2. Theoretical and Applied Linguistics, Cambridge
  • 3. MIASU, Cambridge University

Description

In June 2019 the Mongolian and Inner Asian Studies Unit at the Department of Social Anthropology, Cambridge University, carried out a six-month incubator project to assess the state of available NLP tools for use with vertical Mongolian (Uyghur script). The project, funded by Cambridge Language Sciences, was interested primarily in the use of these tools for the study of contemporary newspapers and media published in vertical Mongolian inside the PRC.  

The project produced two reports on the status of NLP and NER for Vertical Mongolian as of November 2019. The reports were written by Sargai Yunshaab ("Survey of Natural Language Processing for Vertical Mongolian: Current Situation,) and Christian Faggionato ("Testing Available Mongolian NER tools and Future Perspectives"), with support from Robert Barnett (Project Coordinator). Project advisors were Hildegard Diemberger (MIASU, Cambridge), Nathan Hill  (SOAS, London), Marieke Meelen (Cambridge), and Thomas White (MIASU, Cambridge).  

The project's findings and data regarding the development of NER for modern Tibetan are available at DOI: 10.5281/zenodo.4536516.. 

Files

Report 1 - Survey of NLP for Vertical Mongolian-Current Situation.pdf