Dataset Open Access

Hacker News lda2vec preprocessed text

Moody, Christopher

Raw data: https://zenodo.org/record/45901

Preprocessed dataset into tokenized forms with noun chunks

Files (492.7 MB)
Name Size
preprocessed.tar.gz
md5:96e7d9d8963242132380148eca42f30b
492.7 MB Download
1,407
261
views
downloads
All versions This version
Views 1,4071,407
Downloads 261261
Data volume 128.6 GB128.6 GB
Unique views 1,3271,327
Unique downloads 228228

Share

Cite as