Published November 7, 2025 | Version v2
Lesson Open

Introduction to corpus building and text analysis with Voyant Tools

Authors/Creators

  • 1. Maastricht University

Description

The slides published here were created for teaching basic text analysis / distant reading with Voyant Tools to MA students in the humanities and social sciences. The focus is on analysing social media data. When the course "Machines of Knowledge" at Maastricht University was first taught, we used Twitter for data collection. After Elon Musk's take-over of the platform and changes to the API, we began using Apple podcast reviews and YouTube comments instead. Also, the technologies we use for data collection have evolved over time. Initially, we used browser-based scraping tools such as Netlytic, given that most of our students have no technical background. In the meantime, we are using Python code that students can easily adjust to collect their own data. The developments of the course are reflected in the slides from the different academic years.

Files

TextAnalysis1_CorpusBuilding.pdf

Files (90.7 MB)

Name Size Download all
md5:0fa6351bcff64c4e556296ae9fabf9da
44.8 MB Preview Download
md5:ef6cc5f94c9f0a111bde61310b44da15
14.2 MB Preview Download
md5:24adad9d0dd1b9063ce94594b70be6da
31.7 MB Preview Download

Additional details

Dates

Created
2022-10-01
original slide decks
Updated
2025-11-06
2025 slide decks

Software

Repository URL
https://monikabarget.github.io/distant-reading/
Programming language
Python
Development Status
Active