Published June 9, 2025
| Version v2
Dataset
Open
SentimentTopicNLP
Authors/Creators
Description
1)The data includes raw data (data.xlsx), the stop word dictionary (stop_words.txt), the custom dictionary (my_dict.txt), as well as other reference dictionaries used in natural language processing (dict_baidu_utf8.txt, dict_pangu.txt, dict_sougou_utf8.txt, dict_tencent_utf8.txt).
2)The main components of the code include: Data Collection(get_cookie.py、weiboSpider_v1.0.3.py、Crawl_user_information.py)、Data Preprocessing(preproce.py)、Sentiment Analysis(cnn_BiLSTM_att.py)、Topic Analysis(0.wordvec.py、1.top_num.py、2.LDA.py、3.topic_evolution.py)
Files
dict_baidu_utf8.txt
Files
(78.4 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:a23f877d4cc8c2a59800249f9342bb94
|
1.0 kB | Download |
|
md5:7c5920d2be907dd900b02b171a81956c
|
2.9 kB | Download |
|
md5:ad5fd1dc129b09c6d484f525dba8f73a
|
5.3 kB | Download |
|
md5:8e828c45ed80fd593703dccf67c987d4
|
7.6 kB | Download |
|
md5:8f3cf597cf80e6cb83a1f9339425f181
|
12.9 kB | Download |
|
md5:64c8e72e6541533c5424d49d48b44dda
|
4.2 kB | Download |
|
md5:6080a7bcc9dc1a75d6139e7f358c620f
|
68.7 MB | Download |
|
md5:a864184dd75e8c354bc2440e26addb45
|
33.8 kB | Preview Download |
|
md5:d955643ba051ce4ea909e7f163367fe3
|
2.0 MB | Preview Download |
|
md5:5a7a8c3d513d577ce61e2eb3a03625ff
|
4.2 MB | Preview Download |
|
md5:5905c2417707ad273a8ffe654f5e4f8b
|
452.5 kB | Preview Download |
|
md5:201d22fa8bf328813ca5a0eb6224a6f4
|
827 Bytes | Download |
|
md5:95635e883b5af7c2f0827b68eac1b797
|
337 Bytes | Preview Download |
|
md5:abb7cc432ba7f81be1a4672c51e33874
|
1.9 kB | Download |
|
md5:00140e7861524774483c2a703861be85
|
3.0 MB | Preview Download |
|
md5:d6409b9a61982d4f07edc1ed6ac07ff7
|
22.4 kB | Preview Download |
|
md5:f7335b7aa4abc4f9590e9cbbcd59397d
|
17.4 kB | Download |