Published March 27, 2018
| Version v1
Dataset
Open
ConceptNet Vector Ensemble 16.04 input data
Description
This is the data required to build the paper "An Ensemble Method to Build High-Quality Word Embeddings", by Robyn Speer and Joshua Chin.
The input data itself comes from:
-
ConceptNet 5.4, which contains data from Wiktionary, WordNet, and many contributors to Open Mind Common Sense projects, edited by Robyn Speer
-
GloVe, by Jeffrey Pennington, Richard Socher, and Christopher Manning
-
word2vec, by Tomas Mikolov and Google Research
-
PPDB, by Juri Ganitkevitch, Benjamin Van Durme, and Chris Callison-Burch
Files
conceptnet5.csv
Files
(12.7 GB)
Name | Size | Download all |
---|---|---|
md5:1c91d2ba803cec741408358bd9e4fdd9
|
400.6 MB | Preview Download |
md5:01fcdb413b93691a7a26180525a12d6e
|
5.0 GB | Preview Download |
md5:eec7d467bccfa914726b51aac484d43a
|
5.6 GB | Preview Download |
md5:558cb3a7aab03697577efa711f98512b
|
9.0 MB | Preview Download |
md5:1c892c4707a8a1a508b01a01735c0339
|
1.6 GB | Download |