Published August 8, 2022 | Version v1
Dataset Open

Frequencies per million words for 5 epidemiologically relevant search terms in a dozen British 19th century newspapers

Authors/Creators

  • 1. Lancaster University

Description

COVID-19 is the first known coronavirus pandemic.  Nevertheless, the seasonal circulation of the four milder coronaviruses of humans – OC43, NL63, 229E and HKU1 – raises the possibility that these viruses are the descendants of more ancient coronavirus pandemics.  This proposal arises by analogy to the observed descent of seasonal influenza subtypes H2N2 (now extinct), H3N2 and H1H1 from the pandemic strains of 1957, 1968 and 2009, respectively.  Recent historical revisionist speculation has focussed on the influenza pandemic of 1889-1892, based on molecular phylogenetic reconstructions that show the emergence of human coronavirus OC43 around that time, probably by zoonosis from cattle.  If the "Russian influenza", as The Times named it in early 1890, was not influenza but caused by a coronavirus, the origins of the other three milder human coronaviruses may also have left a residue of clinical evidence in the 19th century medical literature and popular press.  In this paper, we search digitised 19th century British newspapers for evidence of previously unsuspected coronavirus pandemics.  We conclude that there is little or no corpus linguistic signal in the UK national press for large-scale outbreaks of unidentified respiratory disease for the period 1785 to 1890.

Notes

To view data, open in Microsoft Excel.  To reproduce the data from scratch, a login is needed to CQPweb (https://cqpweb.lancs.ac.uk).  This is free of charge but requires authorization, which can be applied for at the URL given.

Files

README.txt

Files (196.4 kB)

Name Size Download all
md5:e9a077b5e37f6663325e12d75df87787
2.4 kB Preview Download
md5:63a9f91e14c0123615877692d939e87a
194.0 kB Download

Additional details