There is a newer version of this record available.

Dataset Open Access

Supplemental Materials for "How to Read a Million Books: An Introduction to Data Analysis for the Humanities"

Karsdorp, Folgert; Kestemont, Mike; Riddell, Allen

Data discussed in the manuscript "How to Read a Million Books: An Introduction to Data Analysis for the Humanities".

Each folder in this dataset contains data used or discussed in one chapter. The data itself resides in a directory ("data") inside each folder. Each data directory contains a README file which describes the files found in the directory.

Most of the data are texts published before 1900. These texts are in the public domain.

Files (356.4 MB)
Name Size
million-books-data.zip
md5:28ee633c076b54b4a255f1ddc18ad2c6
356.4 MB Download
1,733
1,085
views
downloads
All versions This version
Views 1,73376
Downloads 1,0856
Data volume 443.4 GB2.1 GB
Unique views 1,58569
Unique downloads 6866

Share

Cite as