Conference paper Open Access

A Case Study of Machine Translation in Financial Sentiment Analysis

Zhang, Chong; Capelletti, Matteo; Poulis, Alexandros; Stemann, Thorben; Nemcova, Jane

The European research project Social Sentiment Indices powered by X-Scores (SSIX) intends to allow Small and Medium-sized Enterprises (SMEs) to take advantage of social media sentiment data for the finance domain. The project aims to overcome language barriers and realize a financial sentiment platform capable of scoring textual data in different languages.

Our approach to achieve this goal takes maximum advantage of human translation while keeping costs low by incorporating machine translation. In the long run, we intend to provide a tool that helps SMEs to expand into new markets by analyzing multilingual social contents.

In this paper, we investigate how sentiment is preserved after machine translation. We built a sentiment gold standard corpus in English annotated by native financial experts, and then we translated the gold standard corpus into a target corpus (German) using one human translator and three machine translation engines (Microsoft, Google, and Google Neural Network) which are integrated in Geofluent to allow pre-/post-processing. We then conducted two experiments. One meant to evaluate the overall translation quality using the BLEU algorithm. The other intended to investigate which machine translation engines produce translations that preserve sentiment best.

Results suggest that sentiment transfer can be successful through machine translation if using Google and Google Neural Network in Geofluent. This is a crucial step towards achieving a multilingual sentiment platform in the domain of finance. Next, we plan to integrate language-specific processing rules to further enhance the performance of machine translation.

Files (478.0 kB)
Name Size
A-Case-Study-of-Machine-Translation-in-Financial-Sentiment-Analysis.pdf
md5:21ee95430886abee8811a0461646204b
478.0 kB Download
72
150
views
downloads
All versions This version
Views 7272
Downloads 150150
Data volume 71.7 MB71.7 MB
Unique views 6262
Unique downloads 139139

Share

Cite as