Published January 4, 2021
| Version v1.1
Software
Open
WHOSpeechAnalysis/data: WHO director general's speeches
Description
This release contains the initial software needed to download the WHO director general's speeches.
The attached gzip'ed data set contains data from 2003/07/21 through 2020/12/28 and includes the below. Additional data can be retrieved by following the steps outlined in the README.
- All the raw HTML pages can be found in
~/raw - The list of the speeches is in
speeches.txt - The parsed speeches are in
corpus.jsonl. - The final tokenized speeches are in
corpus.tokenized.jsonl.
Files
WHOSpeechAnalysis/data-v1.1.zip
Files
(12.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:ee52c2cac9715cd9304c8b897835631a
|
12.6 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/WHOSpeechAnalysis/data/tree/v1.1 (URL)