There is a newer version of the record available.

Published June 9, 2025 | Version v2
Dataset Open

SentimentTopicNLP

Authors/Creators

Description

1)The data includes raw data (data.xlsx), the stop word dictionary (stop_words.txt), the custom dictionary (my_dict.txt), as well as other reference dictionaries used in natural language processing (dict_baidu_utf8.txt, dict_pangu.txt, dict_sougou_utf8.txt, dict_tencent_utf8.txt).
2)The main components of the code include: Data Collection(get_cookie.py、weiboSpider_v1.0.3.py、Crawl_user_information.py)、Data Preprocessing(preproce.py)、Sentiment Analysis(cnn_BiLSTM_att.py)、Topic Analysis(0.wordvec.py、1.top_num.py、2.LDA.py、3.topic_evolution.py)

Files

dict_baidu_utf8.txt

Files (78.4 MB)

Name Size Download all
md5:a23f877d4cc8c2a59800249f9342bb94
1.0 kB Download
md5:7c5920d2be907dd900b02b171a81956c
2.9 kB Download
md5:ad5fd1dc129b09c6d484f525dba8f73a
5.3 kB Download
md5:8e828c45ed80fd593703dccf67c987d4
7.6 kB Download
md5:8f3cf597cf80e6cb83a1f9339425f181
12.9 kB Download
md5:64c8e72e6541533c5424d49d48b44dda
4.2 kB Download
md5:6080a7bcc9dc1a75d6139e7f358c620f
68.7 MB Download
md5:a864184dd75e8c354bc2440e26addb45
33.8 kB Preview Download
md5:d955643ba051ce4ea909e7f163367fe3
2.0 MB Preview Download
md5:5a7a8c3d513d577ce61e2eb3a03625ff
4.2 MB Preview Download
md5:5905c2417707ad273a8ffe654f5e4f8b
452.5 kB Preview Download
md5:201d22fa8bf328813ca5a0eb6224a6f4
827 Bytes Download
md5:95635e883b5af7c2f0827b68eac1b797
337 Bytes Preview Download
md5:abb7cc432ba7f81be1a4672c51e33874
1.9 kB Download
md5:00140e7861524774483c2a703861be85
3.0 MB Preview Download
md5:d6409b9a61982d4f07edc1ed6ac07ff7
22.4 kB Preview Download
md5:f7335b7aa4abc4f9590e9cbbcd59397d
17.4 kB Download