Published October 1, 2019 | Version v1
Conference paper Open

Automating Dictionary Production: a Tagalog-English-Korean Dictionary from Scratch

Description

In this paper we present lexicographic work on a Tagalog-English-Korean dictionary. The dictionary is created entirely from scratch and all of its content (besides audio pronunciation) is initially generated fully automatically from a large web corpus that we built for these purposes, and then post-edited by human editors. The full size of the dictionary is 45,000 entries, out of which 15,000 most frequent entries are manually post-edited, while the remaining 30,000 entries are left only as automated. The project is currently ongoing and will be finished in December 2019. The dictionary will be part of the online platform run by the Naver Corporation1 and freely available.

Files

eLex_2019_45.pdf

Files (627.3 kB)

Name Size Download all
md5:430f58ee1337f4635e762ebe81b73640
627.3 kB Preview Download

Additional details

Funding

ELEXIS – European Lexicographic Infrastructure 731015
European Commission