There is a newer version of the record available.

Published September 14, 2017 | Version v1
Dataset Open

Supplemental Materials for "How to Read a Million Books: An Introduction to Data Analysis for the Humanities"

  • 1. Meertens Institute
  • 2. University of Antwerp
  • 3. Indiana University

Description

Data discussed in the manuscript "How to Read a Million Books: An Introduction to Data Analysis for the Humanities".

Each folder in this dataset contains data used or discussed in one chapter. The data itself resides in a directory ("data") inside each folder. Each data directory contains a README file which describes the files found in the directory.

Most of the data are texts published before 1900. These texts are in the public domain.

Files

million-books-data.zip

Files (356.4 MB)

Name Size Download all
md5:28ee633c076b54b4a255f1ddc18ad2c6
356.4 MB Preview Download