There is a newer version of the record available.

Published October 14, 2021 | Version v1
Dataset Open

Financial News dataset for text mining

  • 1. INRAE

Description

please cite this dataset by :

Nicolas Turenne, Ziwei Chen, Guitao Fan, Jianlong Li, Yiwen Li, Siyuan Wang, Jiaqi Zhou  (2021) Mining an English-Chinese parallel Corpus of Financial News,  BNU HKBU UIC, technical report

 

The dataset comes from Financial Times news website (https://www.ft.com/)

news are written in both languages Chinese and English.

The dataset contains 60,473 bilingual  documents.

Time range is from 2007 and 2020.   

This dataset has been used for parallel bilingual news mining in Finance domain.

Notes

Turenne N et al (2021) Mining an English-Chinese parallel Corpus of nancial News

Files

FTIE.zip

Files (104.6 MB)

Name Size Download all
md5:25479a51e5749a2f462461c29e850844
104.6 MB Preview Download

Additional details

References

  • Turenne N et al (2021) Mining an English-Chinese parallel Corpus of nancial News