Published January 4, 2021 | Version v1.1
Software Open

WHOSpeechAnalysis/data: WHO director general's speeches

Authors/Creators

  • 1. Harrisburg University of Science and Technology

Description

This release contains the initial software needed to download the WHO director general's speeches.

The attached gzip'ed data set contains data from 2003/07/21 through 2020/12/28 and includes the below. Additional data can be retrieved by following the steps outlined in the README.

  • All the raw HTML pages can be found in ~/raw
  • The list of the speeches is in speeches.txt
  • The parsed speeches are in corpus.jsonl.
  • The final tokenized speeches are in corpus.tokenized.jsonl.

Files

WHOSpeechAnalysis/data-v1.1.zip

Files (12.6 kB)

Name Size Download all
md5:ee52c2cac9715cd9304c8b897835631a
12.6 kB Preview Download

Additional details

Related works