Published June 5, 2026 | Version v3

The intrinsic geometry of reading

  • 1. ROR icon Centre National de la Recherche Scientifique
  • 2. ROR icon Wellcome Centre for Integrative Neuroimaging
  • 1. EDMO icon University of Leipzig
  • 2. Max Planck School of Cognition, Leipzig, Germany.
  • 3. ROR icon Wellcome Centre for Integrative Neuroimaging
  • 4. ROR icon Centre National de la Recherche Scientifique
  • 5. ROR icon Inria Saclay - Île de France
  • 6. ROR icon University of Surrey
  • 7. Fundación para la Lucha contra las Enfermedades Neurológicas de la Infancia
  • 8. Kings College London
  • 9. ROR icon Queen's University
  • 10. ROR icon University of York
  • 11. Full brain picture Analytics, Leiden, The Netherlands

Description

Data Repository: The Intrinsic Geometry of Reading

Data files associated with the manuscript "The intrinsic geometry of reading."

The analysis code and notebooks are available in the associated GitHub repository: https://github.com/NeuroanatomyAndConnectivity/IntrinsicGeometryOfReading

Contents

Model results for Notebooks 2–5

data_nb_2_5.tar.gz contains the outputs of pre-run CPM models needed to reproduce the model performance plots in Notebooks 2–5. To use:

  1. Unpack the archive into the data/ folder inside your local clone of the code repository:
tar -xzf data_nb_2_5.tar.gz -C /path/to/IntrinsicGeometryOfReading/data/
  1. Update the data_dir path variable at the top of each notebook to point to the unpacked location:
data_dir = '/path/to/IntrinsicGeometryOfReading/data/'

Note: Data for Notebook 6 (Supplementary Figures S2–S5) is not included here due to file size. It is available upon request.

Raw data for recreating datasets (Schaefer 400 parcellation)

All raw data needed to recreate the analysis datasets from scratch are provided as CSVs for the Schaefer 400 parcellation. These can be used with Notebook 1 (nb1_make_a_dataset.ipynb) to rebuild subject-level dataset objects. The discovery_data_ordered.tar.gz also contains the parquet files from other parcellations. These can be used to reconstruct datasets when bevahioural data is available. Once datasets are reconstructed, full CPM analysiss can be rerun. 

 

Files

discovery_confound_total_surface_area.csv

Files (12.4 GB)

Name Size
md5:a0601895d01960773e7589d54ff74af1
620.1 MB Download
md5:a957c06c5e8452f0423500dce9c1c743
9.6 GB Download
md5:88acbc917fdfa596c3554784e9c2289d
25.4 kB Preview Download
md5:af988bf3daacff7ec9b732a6a3ec949b
3.9 MB Preview Download
md5:cd96bd66d11765fea4ef88ce54f1671d
734.7 MB Preview Download
md5:ad9371fd472316efa3b7e8fad16b041b
298.4 MB Preview Download
md5:601214cf585040e292514904d5ab2266
787.7 MB Preview Download
md5:43e08f7dcac16a1f5e7c372b1b605e50
322.9 MB Preview Download
md5:62a4dba7089798e5966bcdfd68a53b46
3.8 MB Preview Download
md5:c0db10c3f435e7d4621a96497cf59f2c
1.6 MB Preview Download
md5:c769b3a0ec81b4e3b70df0f46b462ff9
10.6 kB Preview Download

Additional details

Funding

European Commission
CORTIGRAD - Cortical gradients of functional integration 866533