Published June 3, 2019 | Version v1
Dataset Open

Word2Vec Models Dutch Newspapers

  • 1. DHLab - KNAW Huckster's

Description

Word Embedding models trained on 6 national Dutch newspapers. 

We use the Gensim implementation of Word2Vec to train four embedding models per newspaper, each representing one decade between 1950 and 1990. The models were trained using C-BOW with hierarchical softmax, with a dimensionality of 300, a minimal word count and context of 5, and downsampling of 10-5

These models belong to the article: Using Word Embeddings to Examine Gender Bias in Dutch Newspapers, 1950-1990

Files

Files (1.9 GB)

Name Size Download all
md5:4cde1ea3f071a2e3a316170ae23a398b
972.8 kB Download
md5:bc6943bd6a141730d77514aa455e41d1
84.9 MB Download
md5:ffd08858883f7a21c7ade717a56085a2
1.0 MB Download
md5:7761b06f17aa66c1c94c7027f6b5c8e8
88.6 MB Download
md5:7891055386570f29fdccb4753d31fd39
1.2 MB Download
md5:68c56a87bc3b872cadf8462acf454c7e
99.7 MB Download
md5:59361c93825fe381b04be3b36044232c
1.4 MB Download
md5:49f9128856b074353ad8d33768f13705
116.2 MB Download
md5:d75048bdf3ab15d27e52b54e177ced96
852.9 kB Download
md5:8d71109768d2f74b78de9538172bd513
76.0 MB Download
md5:ed6badb8599f6baff3e6fc0752da030e
831.6 kB Download
md5:58ad72d5ca0ae5f99091b10edca4d2cf
74.4 MB Download
md5:a696507de873a9a35804b4587e5480af
807.7 kB Download
md5:c613ab2cb5513af09ce98fff197c24b6
72.4 MB Download
md5:7acffc144e612ff7914d8688f5e1a6cc
921.9 kB Download
md5:7f57e1fe2c9335c16807e8170cb3a7e7
81.3 MB Download
md5:f8250b5952f6ba97403beb1724f3bc25
722.1 kB Download
md5:70d606acefa4ac5e2a9646950b9d2ddb
66.0 MB Download
md5:f0e710ac472cfc3f98d4527227f49769
706.3 kB Download
md5:d985e1d34b8388a948978f707eed2888
64.3 MB Download
md5:b9ec4bad03c39e82fd28fb037debc65f
763.7 kB Download
md5:0f3800107348a316945ecc94e7f18e7c
68.7 MB Download
md5:8012e71bd728c2ad42eb84ce56ac085d
821.7 kB Download
md5:fb0ad798b1848c25d85508b4aa9abe6f
72.9 MB Download
md5:9d3cecd0d9e39739b67d2d270f2b64dc
767.9 kB Download
md5:79fbe171321f6db4fd3013c7855c80aa
68.6 MB Download
md5:34d305856aa5f29b06ee03c42d685ce0
782.2 kB Download
md5:1c709ca5739b9dbf602d0e56e653cfb6
69.8 MB Download
md5:ffbdfdbbf745ecab1bf3ebe221cba81f
826.5 kB Download
md5:ba0672e03cbe1b1fbba052e59b583caf
72.8 MB Download
md5:d28f43566fdc4f4812c4b2ccd3fe4cda
917.3 kB Download
md5:e5454fda61468b416723a85451eab5c5
80.2 MB Download
md5:32c9f1cb784df773ec00645ff4b504fc
843.4 kB Download
md5:e43aa997e98a8e4dce79f0e0f586b087
75.3 MB Download
md5:f7408d2660698b2db353e5ab9b8ea2c5
868.5 kB Download
md5:154fe3f9188146357a801ee512bace09
77.2 MB Download
md5:d037b98e92dbf786f4dd9fffa422425b
935.0 kB Download
md5:b9aa3c1dd7b52edb5367c3deb84a4fe1
81.6 MB Download
md5:b832638911dc4f4900146a05c54e273d
1.1 MB Download
md5:e0212b7e95805f05fcd182467d2ced3f
94.7 MB Download
md5:71e2a56d1eafd122b4c4b9038964ff59
816.2 kB Download
md5:81a06a75dac21ebb4693232dda829d34
73.0 MB Download
md5:0259a2617df35bbd0550aa94896a7452
792.7 kB Download
md5:7642bec477dbd775ad154a0ab22bf165
70.9 MB Download
md5:a73ff2c83e6b69f50d594073273de8f1
791.5 kB Download
md5:8ff93c8eca565f89495c0237c769949a
70.9 MB Download
md5:f6e92a99ab95ba28bd4a4d1d2f20d08d
912.8 kB Download
md5:e40347e438d9926ce94afdfb796af2fe
80.3 MB Download