The Trump 2016 Primary Corpus (TPC2016)
Description
The Trump 2016 Primary Corpus (TPC2016) is a curated and annotated dataset containing full transcripts of Donald J. Trump’s campaign speeches delivered during the 2016 Republican primary. The corpus includes 222 rally transcripts, each accompanied by detailed metadata, an annotation guide, citation formats, and a licensing agreement. This collection is intended for use in political discourse analysis, corpus linguistics, media studies, and computational text analysis.
All materials are encoded in UTF-8 plain text and follow a standardized naming convention. The dataset includes a CSV metadata file with detailed contextual information for each rally, a license file, and documentation for reuse.
TPC2016 is offered under a CC BY 4.0 license and aims to support reproducibility and openness in the study of populism, rhetoric, and political communication.
Registrado ante el Registro Territorial de la Propiedad Intelectual de la Comunidad de Madrid con fecha 14/05/2025 (Ref: 49/165181.9/25).
Files
TPC2016_Corpus.zip
Files
(4.3 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:b8c3238de118dd1cd59a8945a4cc8561
|
4.3 MB | Preview Download |