Dataset Open Access
Karsdorp, Folgert; Kestemont, Mike; Riddell, Allen
Data discussed in the manuscript "How to Read a Million Books: An Introduction to Data Analysis for the Humanities".
Each folder in this dataset contains data used or discussed in one chapter. The data itself resides in a directory ("data") inside each folder. Each data directory contains a README file which describes the files found in the directory.
Most of the data are texts published before 1900. These texts are in the public domain.
Name | Size | |
---|---|---|
million-books-data.zip
md5:28ee633c076b54b4a255f1ddc18ad2c6 |
356.4 MB | Download |
All versions | This version | |
---|---|---|
Views | 1,733 | 76 |
Downloads | 1,085 | 6 |
Data volume | 443.4 GB | 2.1 GB |
Unique views | 1,585 | 69 |
Unique downloads | 686 | 6 |