Dataset Open Access
Keyword counts from US Presidential State of the Union Addresses and Presidential Budget Messages. This was done using the Python scripts provided under https://github.com/JeremySilver/KeywordCountsPresidentialMessages. The raw text data is from The American Presidency Project (UCSB), with some Presidential Budget Messages being extracted from US Federal Budget documents available through FRASER (a digital library of U.S. economic, financial, and banking history) or, for the more recent documents the website of the White House.
The data headings are:
Below is the list of keywords that match when the search is applied to a dictionary file containing over 99,000 US English words.
The dictionary file used is a standard file among Linux systems, and the version used was provided with version 7.1-1 of the Ubuntu 'wamerican' package. Two extra phrases, which do not appear in the dictionary file, are added to the list: 'civil rights' (under the 'racism' keyword) and 'natural resources' (under the 'natural resources' theme).
Name | Size | |
---|---|---|
results_PBM.txt
md5:fd44fceda37d3f1bef39543ed87dc11d |
40.7 kB | Download |
results_SoU.txt
md5:19dd5e5fb940579f362b13e386213de2 |
35.9 kB | Download |
All versions | This version | |
---|---|---|
Views | 194 | 194 |
Downloads | 37 | 37 |
Data volume | 1.4 MB | 1.4 MB |
Unique views | 147 | 147 |
Unique downloads | 28 | 28 |