Conference paper Open Access

Machine Translation as an Alternative to Language-Specific Dictionaries for LIWC

Yuying Ye; Peter Boot

 

This research presents an alternative approach for applying the text analysis program, Linguistic Inquiry and Word Count (LIWC) to non-English text: using machine translation (MT) to translate the text into English and then apply the English LIWC dictionary, rather than applying a translated LIWC dictionary to the original texts. We evaluate the alternative method using the output from two open-source MT systems and Google Translation on a set of TED Talk subtitle data. The method is tested on three language pairs: Dutch-English, German-English and Spanish-English. Provisionally, MT seems to lead to better results in the three language pairs than language-specific dictionaries.

Link to the slides: https://slides.com/yuyingye/deck-bc15f3/
Files (377.4 kB)
Name Size
Machine Translation as an Alternative to Language-Specific Dictionaries for LIWC.pdf
md5:70aabb0e9974239d157d30397f467bca
377.4 kB Download
164
225
views
downloads
All versions This version
Views 164164
Downloads 225225
Data volume 84.9 MB84.9 MB
Unique views 147147
Unique downloads 212212

Share

Cite as