Published January 11, 2020
| Version v1
Dataset
Open
Wbbyyr: FastText language models for Mandarin Chinese, trained on 14m Sina Weibo posts for each year in 2012-2018 (Fold 1 of 10)
Description
Wbbyyr: FastText language models for Mandarin Chinese, trained on 14,440,000 Sina Weibo posts for each year in 2012-2018.
The 14,440,000 posts from each year are split into 10 folds. Due to Zenodo size limit, this dataset contains only the first fold from each year.
Each model is trained for 20 iterations. Each vector is 300 dimensions long.
Files
Files
(40.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:a1434102627e5af021091c69bf61c3c3
|
53.3 MB | Download |
|
md5:7e073d28e9742da7f37e960a25be9584
|
638.9 MB | Download |
|
md5:71e14e860758778700aa6e9d299b6a87
|
1.9 GB | Download |
|
md5:0594d4a36ef64405ce77d6b769e38c4a
|
638.9 MB | Download |
|
md5:592733f6032b1004646e064d0bddcfff
|
638.9 MB | Download |
|
md5:7d84871108c00b59177ff3505fe83676
|
1.9 GB | Download |
|
md5:a0ec7253579bda312c5aadeaff290a61
|
638.9 MB | Download |
|
md5:babaa7a1d0fa116d6702667d61f8179f
|
51.0 MB | Download |
|
md5:e8a5a7c67aee95900c7863830e170397
|
603.8 MB | Download |
|
md5:78e68f218318ea725ee50a32b7ceb83a
|
1.8 GB | Download |
|
md5:28019e7e77b002f68156e185b6228b84
|
603.8 MB | Download |
|
md5:1f33d32d68eb2b7075e904497d8f844e
|
603.8 MB | Download |
|
md5:46048816534bf78447980f6aabe3e5df
|
1.8 GB | Download |
|
md5:07be16f644664bfbf30ee62731839259
|
603.8 MB | Download |
|
md5:7f89a6e80ada839687dbfd06990d8366
|
40.2 MB | Download |
|
md5:1cdab702f15b45699aca677c1bb1ab3b
|
465.3 MB | Download |
|
md5:166d701cb5141a39a68d2e13205285b8
|
1.5 GB | Download |
|
md5:f79e385299c18c8101801764476e7856
|
465.3 MB | Download |
|
md5:7f4271aa0777c5bb2a51dabdc1a9f516
|
465.3 MB | Download |
|
md5:b0a0d32af4c60e1493d3e38dd3d496b9
|
1.5 GB | Download |
|
md5:984ca9542a27fc4687460ff70b84812a
|
465.3 MB | Download |
|
md5:a6256c85a4d5b3f8a63beedcd8432024
|
37.2 MB | Download |
|
md5:2f472bc82271323e477c2065de9527eb
|
424.4 MB | Download |
|
md5:c7ca9e0a3a1c20e4016ab2f0bc597de0
|
1.5 GB | Download |
|
md5:5083bce0ef7a307d0212dc0606ff84c7
|
424.4 MB | Download |
|
md5:546add0698190023a5421fc468f10bae
|
424.4 MB | Download |
|
md5:1d2c96cda25f8a01ffca77550b8ee8db
|
1.5 GB | Download |
|
md5:4c773988f1660f4745291188527e3add
|
424.4 MB | Download |
|
md5:4b7cdc5d0368a4109838307b4a718da1
|
53.4 MB | Download |
|
md5:214b4ad70bcaaba2f681d8c59b9cc7d2
|
631.7 MB | Download |
|
md5:6a6a22d1e674e076932299b823317639
|
1.9 GB | Download |
|
md5:bf6f09c22fc4a11cad0f2b5a8f193306
|
631.7 MB | Download |
|
md5:3ad0d05e571ac37a35601feffed0913c
|
631.7 MB | Download |
|
md5:d8f07fe43e4c1fbe4eda8b7f8c902fcf
|
1.9 GB | Download |
|
md5:3dd71320560b277b21f57fefa3109ad5
|
631.7 MB | Download |
|
md5:e07ef24f7748de7086c530bce2667989
|
51.7 MB | Download |
|
md5:4fa1ddf48089601ee857bc34e1367133
|
610.3 MB | Download |
|
md5:091321260b3b74bf17671dbce655cf5c
|
1.9 GB | Download |
|
md5:0b2e2737e0d5a42211c08f95a98e252b
|
610.3 MB | Download |
|
md5:7f73f6c4f0a5f592054424239bd35b4c
|
610.3 MB | Download |
|
md5:5a877ff87b3f34862bb89a2754dbd116
|
1.9 GB | Download |
|
md5:6d24dbd06ca3a230bca36201c3a98c2d
|
610.3 MB | Download |
|
md5:9c0bdd6bde44db92440efdf798621d64
|
48.4 MB | Download |
|
md5:efb5b246bbe6247ec162a20ccf555457
|
566.3 MB | Download |
|
md5:35c0b7b458433f41bbe081c4457c37c5
|
1.8 GB | Download |
|
md5:3093a6ea8db9b23ab575b28f613f7b67
|
566.3 MB | Download |
|
md5:2a1023f36575e332ed08e7a57b788385
|
566.3 MB | Download |
|
md5:fb8af543d3da99beac7db7ca754eb8d5
|
1.8 GB | Download |
|
md5:93ea4618094bc18b82edbbe7fd749de3
|
566.3 MB | Download |