OpenRefine for the Humanities
Description
This lesson introduces OpenRefine as a tool for exploring, cleaning, transforming, and enriching tabular data commonly used in humanities and cultural heritage research. Using a subset of the Metropolitan Museum of Art Open Access dataset, learners gain hands-on experience with importing data, identifying and correcting data quality issues, creating facets and filters, using GREL expressions, transforming and restructuring data, clustering similar values, reconciling records with external authority sources, and exporting reproducible data-cleaning workflows.
Designed for beginners with little or no prior technical experience, the lesson follows a practical, example-driven approach that emphasizes exploratory data analysis, transparent data cleaning, and reproducible research practices. It is specifically aimed at scholars in the humanities and cultural studies as well as professionals working in Galleries, Libraries, Archives, and Museums (GLAM).
A rendered version of the lesson can be found here.
Files
open-refine-humanities-2026.06.24.zip
Files
(1.9 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:7733fb650f96451176f6fad01e2a5f5f
|
1.9 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/HERMES-DKZ/open-refine-humanities/tree/v2026.06.24 (URL)
Software
- Repository URL
- https://github.com/HERMES-DKZ/open-refine-humanities