Published November 8, 2021 | Version v1.0
Software Open

Source code for: Predicting standardized absolute returns using rolling-sample textual modelling

Description

Abstract

Understanding how textual information impacts financial market volatility has been one of the growing topics in financial econometric research. In this paper, we aim to examine the relationship between the volatility measure that is extracted from GARCH modeling and textual news information that is both publicly available and from subscription and the performances of the two datasets are also brought into comparison. We utilize latent Dirichlet allocation method to capture the dynamic features of the textual data overtime by summarizing their statistical outputs, such as topic distributions in documents and word distributions in topics. In addition, we transform various measures representing the popularity and diversity of topics to form predictors for rolling regression model to assess the usefulness of textual information. The proposed method captures the statistical properties of textual information from different time periods and its performance is evaluated in an out-of-sample analysis.

Our results show that the topic measures can be more useful for predicting our volatility proxy, the unexplained variance from GARCH model than the simple moving average. The finding indicates that our method can be helpful in extracting significant textual information to improve the prediction of stock market volatility.

 

This work is licensed under CC0 1.0 Universal (CC0 1.0).

 

Files

fintech-analytics/text-analytics-v1.0.zip

Files (25.9 MB)

Name Size Download all
md5:80600f7ea2a2149d7db8dae71a8b2fcd
25.9 MB Preview Download

Additional details