Published May 13, 2025 | Version v1
Dataset Open

The Trump 2016 Primary Corpus (TPC2016)

  • 1. ROR icon Universidad Complutense de Madrid
  • 2. ROR icon Liberty University

Description

The Trump 2016 Primary Corpus (TPC2016) is a curated and annotated dataset containing full transcripts of Donald J. Trump’s campaign speeches delivered during the 2016 Republican primary. The corpus includes 222 rally transcripts, each accompanied by detailed metadata, an annotation guide, citation formats, and a licensing agreement. This collection is intended for use in political discourse analysis, corpus linguistics, media studies, and computational text analysis.

All materials are encoded in UTF-8 plain text and follow a standardized naming convention. The dataset includes a CSV metadata file with detailed contextual information for each rally, a license file, and documentation for reuse.

TPC2016 is offered under a CC BY 4.0 license and aims to support reproducibility and openness in the study of populism, rhetoric, and political communication.

Registrado ante el Registro Territorial de la Propiedad Intelectual de la Comunidad de Madrid con fecha 14/05/2025 (Ref: 49/165181.9/25).

Files

TPC2016_Corpus.zip

Files (4.3 MB)

Name Size Download all
md5:b8c3238de118dd1cd59a8945a4cc8561
4.3 MB Preview Download