Published March 27, 2018 | Version v1
Dataset Open

ConceptNet Vector Ensemble 16.04 input data

  • 1. Luminoso
  • 2. Union College

Description

This is the data required to build the paper "An Ensemble Method to Build High-Quality Word Embeddings", by Robyn Speer and Joshua Chin.

The input data itself comes from:

  • ConceptNet 5.4, which contains data from Wiktionary, WordNet, and many contributors to Open Mind Common Sense projects, edited by Robyn Speer

  • GloVe, by Jeffrey Pennington, Richard Socher, and Christopher Manning

  • word2vec, by Tomas Mikolov and Google Research

  • PPDB, by Juri Ganitkevitch, Benjamin Van Durme, and Chris Callison-Burch

Files

conceptnet5.csv

Files (12.7 GB)

Name Size Download all
md5:1c91d2ba803cec741408358bd9e4fdd9
400.6 MB Preview Download
md5:01fcdb413b93691a7a26180525a12d6e
5.0 GB Preview Download
md5:eec7d467bccfa914726b51aac484d43a
5.6 GB Preview Download
md5:558cb3a7aab03697577efa711f98512b
9.0 MB Preview Download
md5:1c892c4707a8a1a508b01a01735c0339
1.6 GB Download