Published December 29, 2025 | Version v1
Publication Open

Investigating causal networks of dementia using causal discovery and natural language processing models

  • 1. ROR icon Imperial College London
  • 2. ROR icon University of Manchester

Description

Comprehensively studying modifiable risk factors to understand their contributions to dementia mechanisms is imperative. This study used natural language processing (NLP) models to pre-select candidate risk factors for dementia from 5505 baseline variables in the UK Biobank. We then applied causal discovery approaches to examine the relationships among the selected variables and their links to dementia in later life, presenting these connections in a causal network. We identified eight risk factors that directly or indirectly influence dementia, with mental disorders due to brain dysfunction (ICD-10 F06) acting as direct causes and mediators in pathways from other neurological disorders to dementia. Although evidence for the direct link between biological age and dementia was less pronounced, its potential value in dementia management remains non-negligible. This study advances our understanding of dementia mechanisms and highlights the potential of NLP and machine learning for the causal discovery of complex diseases from high-dimensional data.

Files

Zenodo.png

Files (86.5 kB)

Name Size Download all
md5:ab6715947e62ea4dce1dc2a9e1c0ba50
86.5 kB Preview Download

Additional details

Related works

Is identical to
Publication: 10.1038/s44400-025-00006-2 (DOI)

Dates

Submitted
2025-05