This README describes what can be found in the accompanying spreadsheet, search in papers for charts (5 words).xlsx
The spreadsheet has several tabs, each with the name of a newspaper.  In each tab, there are five columns of data for each of the five search terms.  Each column of data has a column of years alongside it.  The values are per million words in category.  This should be interpreted as follows.
For instance in the tab Belfast Newsletter, for search term influenza and under the year 1831, the value given is 0.6.  This means that search term influenza was found at a frequency of 0.6 words per million among all the words in the Belfast Newsletter in 1831.
To give another example, search term cough in Pall Mall Gazette for the year 1890 was found at a frequency of 61.36 words/million.
Some of the cells are coloured.  These are the ones that exceed significance thresholds.  Below each table you can see how these were calculated.
For instance in the tab Belfast Newsletter, it can be seen that the average occurrence of influenza was 2.11 words/million over the years 1831 to 1900 inclusive.  Click in this cell to see the spreadsheet formula used (its an ordinary AVERAGE(XY))
Similarly, the standard deviation of this average is calculated in the cell below (its STDEV(XY)) and the 95% tail estimated at 1.96 standard deviations (because we use about 10 newspapers, each search term is tested about 10 times for each year  so the Bonferroni correction is applied)
The cell below this one calculates the threshold of significant occurrence (average + 1.96 standard deviations).  So for search term influenza the threshold is 8.0 for the Belfast Newsletter.
Conditional formatting is then applied to colour those above threshold.  So, in the Belfast Newsletter, for the year 1890, the following search terms are significant: influenza and epidemic, and so on.
Most of the tabs also have graphs plotting the data.  These can be useful for checking significant years, although they are a little messy to be usable in the publication.
The tab labelled SUMMARY is the basis of Table 1 in the publication.  For instance, for year 1865 and search term epidemic, it lists Liverpool, Glasgow, Belfast, Times.  Check back with the relevant tabs to see the values for each of these newspapers for 1865 and epidemic.  Youll see they are all coloured red.
