Dataset Open Access

The Threatening English Language (TEL) Corpus

Gales, Tammy; Nini, Andrea; Symonds, Ellen

TEL is the Threatening English Language corpus. It is a collection of 309 written texts compiled from the publicly-available portion of CTARC (the Communicated Threat Assessment Research Corpus, compiled by Tammy Gales), MFT (the Malicious Forensic Texts corpus, compiled by Andrea Nini), and the written portion of CoJO (the Corpus of Judicial Opinions, compiled by Julia Muschalik). Additional texts are from ForensicLing.com (the forensic linguistic data site hosted by Tammy Gales and Dakota Wing). Basic metadata is supplied for each text where known from the original case research. We wish to thank our graduate student fellows who helped compile the texts and metadata: Nicole Harris, Annina van Riper, Zara Rabinko, and Zachary Boudreaux.

Total texts: 309
Total estimated authors: 203
Total word count: 54,167

METADATA KEY

TG = Tammy Gales (public portion of CTARC)
AN = Andrea Nini (MFT)
JM = Julia Muschalik (written portion of CoJo)
FL = ForensicLing.com (Tammy Gales and Dakota Wing)

Name###_## = file name, case number, text number within case
File name might be threat recipient or author; remaining info is about the author, where known

Files (548.1 kB)
Name Size
TEL Corpus -- Spreadsheet version.csv
md5:a47b1356cc5b06c65319e9e878855678
331.9 kB Download
TEL Corpus -- Plain TXT Files.zip
md5:1989043536f0f8b2ad8f2c0b786b1d96
216.2 kB Download
357
154
views
downloads
All versions This version
Views 357357
Downloads 154154
Data volume 50.3 MB50.3 MB
Unique views 328328
Unique downloads 133133

Share

Cite as