Source code for: Predicting standardized absolute returns using rolling-sample textual modelling
Authors/Creators
Description
Abstract
Understanding how textual information impacts financial market volatility has been one of the growing topics in financial econometric research. In this paper, we aim to examine the relationship between the volatility measure that is extracted from GARCH modeling and textual news information that is both publicly available and from subscription and the performances of the two datasets are also brought into comparison. We utilize latent Dirichlet allocation method to capture the dynamic features of the textual data overtime by summarizing their statistical outputs, such as topic distributions in documents and word distributions in topics. In addition, we transform various measures representing the popularity and diversity of topics to form predictors for rolling regression model to assess the usefulness of textual information. The proposed method captures the statistical properties of textual information from different time periods and its performance is evaluated in an out-of-sample analysis.
Our results show that the topic measures can be more useful for predicting our volatility proxy, the unexplained variance from GARCH model than the simple moving average. The finding indicates that our method can be helpful in extracting significant textual information to improve the prediction of stock market volatility.
This work is licensed under CC0 1.0 Universal (CC0 1.0).
Files
fintech-analytics/text-analytics-v1.0.zip
Files
(25.9 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:80600f7ea2a2149d7db8dae71a8b2fcd
|
25.9 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/fintech-analytics/text-analytics/tree/v1.0 (URL)