Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published July 1, 2020 | Version v1
Dataset Open

Japanese COVID-19 Tweets from 2020-01-17 to 2020-04-30 (40,720,545 tweets and 105,317,606 retweets)

  • 1. The University of Tokyo
  • 2. Hottolink, Inc.
  • 3. Toyohashi University of Technology

Description

Abstract (our paper)

The spread of COVID-19, the so-called new coronavirus, is currently having an enormous social and economic impact on the entire world. Under such a circumstance, the spread of information about the new coronavirus on SNS is having a significant impact on economic losses and social decision-making. In this study, we investigated how the new type of coronavirus has become a social topic in Japan, and how it has been discussed. In order to determine what kind of impact it had on people, we collected and analyzed Japanese tweets containing words related to the new corona on Twitter. First, we analyzed the bias of users who tweeted. As a result, it is clear that the bias of users who tweeted about the new coronavirus almost disappeared after February 28, 2020, when the new coronavirus landed in Japan and a state of emergency was declared in Hokkaido, and the new corona became a popular topic. Second, we analyzed the emotional words included in tweets to analyze how people feel about the new coronavirus. The results show that the occurrence of a particular social event can change the emotions expressed on social media.

Data

Tweets_YYYY-MM-DD.tsv.gz:
The first column is the tweet id, the second column is the date and time (JST) when the tweet was posted, the third column is the flag as to whether the tweet was used for emotion analysis or not, and the fourth column is the tweet id of the retweet source.
This data was collected by giving the query "新型肺炎 OR 武漢 OR コロナ OR ウイルス OR ウィルス" to the Twitter Search API. Therefore, most of the tweets are Japanese tweets.
We conducted emotion analysis on tweets, excluding retweets and tweets containing links. The fourth column is empty if the tweet is not a retweet.

KL-Divergence.tsv.gz:
The first column is the date (JST), and the second column is the value of KL-Divergence that calculated the bias of the users who posted tweets related to COVID-19.
The value of KL-Divergence was calculated with all users appearing in Tweets_YYYY-MM-DD.tsv.gz. Based on the sampling stream data, we determined that if the value is below 0.6, there is no bias.

Emotions_by_ML-Ask.tsv.gz:
The first column is the date (JST), the second and subsequent columns are the number of tweets for each emotion, and the last column is the number of tweets analyzed for the day.
For this analysis, we only used tweets with a value of 1 in the third column of Tweets_YYYY-MM-DD.tsv.gz. We used pymlask (Python implementation of ML-Ask) to estimate the emotion of the tweet.

Publication

This data set was created for our study. If you make use of this data set, please cite:
Fujio Toriumi, Takeshi Sakaki, Mitsuo Yoshida. Social Emotions Under the Spread of COVID-19 Using Social Media. Transactions of the Japanese Society for Artificial Intelligence (in Japanese). vol.35, no.4, pp.F-K45_1-7, 2020.
鳥海不二夫, 榊剛史, 吉田光男. ソーシャルメディアを用いた新型コロナ禍における感情変化の分析. 人工知能学会論文誌. vol.35, no.4, pp.F-K45_1-7, 2020.
https://doi.org/10.1527/tjsai.F-K45

Files

Files (1.9 GB)

Name Size Download all
md5:274a96aaee7d76d24073c985d5b77cad
7.5 kB Download
md5:c8ec2d3d8c546935ac51fa7dbe64846e
3.1 kB Download
md5:0aeb6c302c47632974760a6306038db4
183.4 kB Download
md5:b7a1b54513d744c28fdcd2aa691864f5
162.6 kB Download
md5:a8b6c2535052606ed59a4bcd3c096e7f
162.1 kB Download
md5:8bb43b4d4cbbcac3d094cfb312ef7fea
314.0 kB Download
md5:3d09dcad2365e4ea5cdca66d3dace86f
664.2 kB Download
md5:d394fdd413c52e50218a893d33ab1461
1.3 MB Download
md5:18cfe582d3b58df5d07f88bb652e5eb1
2.3 MB Download
md5:36d620b501c87890a9142386756c61ff
4.7 MB Download
md5:411191effbb53394bba7449cbb836ece
4.8 MB Download
md5:dbbb99d219855d49b1affaddf467dda8
5.0 MB Download
md5:180b8c37096b7a3b652b7a6334068c51
5.3 MB Download
md5:765fe0e785e74acf35adea1a536b2c72
6.9 MB Download
md5:dce219de049e454b0d993d3d25ba8b6f
7.3 MB Download
md5:6b14436c6823e42c05ac1fa1eaaa40f2
9.0 MB Download
md5:f5afb438309e2d7b222a6f8e4f767aab
9.9 MB Download
md5:18df920d2decbaebf692aaff28ddfdc1
5.6 MB Download
md5:f76502d59707b62915b5ea97178dbf24
4.5 MB Download
md5:5ed19604eae2a1f892fd683ac9acf365
4.1 MB Download
md5:932dc4b8e6088ce70cc30a2c9d47ee36
3.5 MB Download
md5:24c77de49fb6ac0e877cb048cf3cd0b8
4.4 MB Download
md5:0bbb1beb055c66b89921319b5d597ab6
4.4 MB Download
md5:17e174afbc3faa22ea4a7571ead71b55
4.3 MB Download
md5:67969048721aacb5fe2897c5fa1c024f
3.9 MB Download
md5:69aeeee465c1656345a0c13534c2cc83
3.9 MB Download
md5:37e5e4ae660cc0f680fbdab8397c3683
3.4 MB Download
md5:119aae742f7b7605114602cd46f7704c
2.9 MB Download
md5:0ec69827bdef12d8ec6eb074cc7577ec
3.5 MB Download
md5:3cac6a30aa22617166aab87129a6a1d4
5.2 MB Download
md5:60ac2d26fa090cbddbf59cc4eae76cc2
7.6 MB Download
md5:6a083c3eec28c2a7ceaf28981a81bc13
9.0 MB Download
md5:5fe27a651e5908f10da5a8ee31300343
9.1 MB Download
md5:71c92b3006a394d7de6805b6343fe6a9
12.0 MB Download
md5:585a8eaeff93bbd3a3883c4554569bf5
10.5 MB Download
md5:a1c56d60c9ce10d0e4c9693288fd1e56
10.6 MB Download
md5:52ac7df33665856fef55856ca0fb78bc
10.1 MB Download
md5:03c5afff781f393a2bf5d17f111aa153
12.5 MB Download
md5:b0f13dbde6563ca44f8f5275acb65907
11.8 MB Download
md5:b178a4dc49cc11273431cc82b701205c
13.3 MB Download
md5:169c8f163ef5ea106f85b846569f6804
15.5 MB Download
md5:7d22e72bf2f8ce241af5517a8949b7e7
18.5 MB Download
md5:111372c2e939238f8ac67043cea288a4
27.1 MB Download
md5:04ab1ab9ab438a0291d8489d4838bfbb
32.6 MB Download
md5:8ba198b3e79b64f4105158157d5da0a1
33.2 MB Download
md5:d3b43ce6c434e14be9aea26021767659
28.2 MB Download
md5:7d99c53c0535c1baee64cb0bb8f03661
21.5 MB Download
md5:7f2c049d4b121f106a28b438f3fcb1d0
22.2 MB Download
md5:c4653cd2c8511f48e8521efa4cb8ba58
20.4 MB Download
md5:d50c341ded922cd12626736ac71ece51
20.7 MB Download
md5:f42478c3a993568e69be7679970f19cb
22.2 MB Download
md5:06dbef26286155c4c23ec75aba49d35c
20.9 MB Download
md5:a70f9fb9f84b3833b0199d6ca571abe9
18.6 MB Download
md5:7a1c0d477171d219bf256f560a45e0e4
17.2 MB Download
md5:bf78adfc4308e946dfbb5b13030bd001
20.8 MB Download
md5:69ffc5211db1a511646cc7528c13978d
22.0 MB Download
md5:64f647c22d1a7a4c66c7632d22b66b96
21.3 MB Download
md5:59db8bb7cec98d024405016928396f6c
23.3 MB Download
md5:4e6fc51221516d908bc110e6110b22c8
19.9 MB Download
md5:066199ce2fc35e5e36ed5a1d6c573f09
18.8 MB Download
md5:fdde6ba085f81f2aad5aa4dc12f2610a
18.0 MB Download
md5:fae7dcdd3ba521d364da0d5d3efecc43
17.0 MB Download
md5:e2db34021479d61e96a2afe29c807d41
19.9 MB Download
md5:77f089d8957f0891b9d5587571d5f4ec
18.7 MB Download
md5:bf005052ad6a3788fd9321b7973a8544
15.5 MB Download
md5:ed004f98bbae99d5fb10632277376e06
14.0 MB Download
md5:dc37b8b8e745ab4b9fe7de8b9efa0d51
12.5 MB Download
md5:44a74665f678b0b70183db56ee33c1ec
14.9 MB Download
md5:c47c100a1cb9807374c8fa7610f29bf3
18.1 MB Download
md5:06e5282d35bad05246bc607b543c4112
17.4 MB Download
md5:2f4af443f2ad39cb0e8e209ece804d8e
22.9 MB Download
md5:98f71e8d1a6c4be9226e60664ee8980a
27.5 MB Download
md5:1a8e8c3261f19544998e2bb6c20eba52
29.4 MB Download
md5:6c5a919805e0212b376b3f48f3c0a1b0
27.4 MB Download
md5:baa961d74fc7b14cb38a0910dc8d8d02
26.4 MB Download
md5:7fcf430d20608864689f7303bc864fec
43.7 MB Download
md5:907de6f6bb8c065814f4b24e149901c2
35.8 MB Download
md5:ceb43f879a9f35d7176ffe9f31d679c8
33.9 MB Download
md5:561ac4995f0bcfe1749ec08a62173bc4
32.5 MB Download
md5:d6f0d5063005480b65781c82a1a31f3a
34.0 MB Download
md5:d7bd2fbcda22987e5c3679101926116e
32.1 MB Download
md5:906a9b037809b284894d515d99766a40
28.2 MB Download
md5:fa04db091318d056c5db3f902d0ed155
32.3 MB Download
md5:acf1ae8c711f5f37fcddadc56de1048f
35.5 MB Download
md5:8e782afc525175434a4e04cecad0405f
36.2 MB Download
md5:93b5b295afe73c4514f4b34c74ab2ff6
34.4 MB Download
md5:c33de56c7616b93f6fd76d0c3afe00ef
32.4 MB Download
md5:852bb8575b94a323c7a8efcc016a122b
28.0 MB Download
md5:8cd1a8e85891061101abf3c91122fbe1
25.8 MB Download
md5:c5dd0c767737d83000e1fc5cb667e07a
25.7 MB Download
md5:76305e964d7c52c608725d8ef8421da8
26.7 MB Download
md5:fcb9b035755a3149ffaffa629d0e7556
28.2 MB Download
md5:f506dd8da60a9272b7d4f8ba33dca17c
26.1 MB Download
md5:d82e786fef013390e7624ee62baaf1bf
26.6 MB Download
md5:dd9f2358b9d267df645770ea355496da
24.1 MB Download
md5:359dd90046c773ec64029326b52cff86
25.7 MB Download
md5:97ae4665af6a91ab65520e15fa545b0c
28.8 MB Download
md5:91b04954e34634a5be8bede859779b3d
27.6 MB Download
md5:4de79d4805fd9eff8311b0c439ec5277
27.5 MB Download
md5:df23d40363ba28982c95708931ba2d15
29.1 MB Download
md5:f0ba27f2019060d01e7ae1ae314e7814
27.8 MB Download
md5:04b5e1bf319860d87bafeae2eeda998b
23.3 MB Download
md5:dfaf5d09dee9bb6a6bbe532136985080
22.9 MB Download
md5:4cf78a16859a4d6cffca37731e8cbf71
22.6 MB Download
md5:e20e265d4c098b7336e206431cab53e7
22.8 MB Download
md5:6021197f08e5826e08414897edff4e8b
19.7 MB Download
md5:67ebce3ab71bb1564b29aeffe06757fa
20.6 MB Download

Additional details

References

  • 鳥海不二夫, 榊剛史, 吉田光男. ソーシャルメディアを用いた新型コロナ禍における感情変化の分析. 人工知能学会論文誌. vol.35, no.4, pp.F-K45_1-7, 2020.