Published March 30, 2017 | Version v1
Conference paper Open

Efficient, Compositional, Order-Sensitive n-gram Embeddings

  • 1. Johns Hopkins University

Description

This is the companion data for the paper, `"Efficient, Compositional, Order-Sensitive n-gram Embeddings, Adam Poliak, Pushpendre Rastogi, M. Patrick Martin, Benjamin Van Durme, EACL(2017).` For more details see https://www.cs.jhu.edu/~apoliak1/papers/ECO--EACL-2017.pdf

 

@inproceedings{Poliak:2017EACL,
Title = {Efficient, Compositional, Order-sensitive n-gram Embeddings},
Author = {Poliak, Adam and Rastogi, Pushpendre and Martin, M. Patrick and Van Durme, Benjamin},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics},
Year = {2017},
Publisher = {Association for Computational Linguistics},
location = {Valencia, Spain}
}

This data contains individual skip-embeddings created and the English Wikipedia data used to generate the embeddings.

dim100_c10.tar.gz is missing the skip-embeddings 3 positions to the right of a given word. They can be downloaded from http://www.cs.jhu.edu/~apoliak1/data/eco/cocoon.mincount~5.dim~100.window~3.dim_divide~10.embeds.gz

Files

Files (47.0 GB)

Name Size Download all
md5:6d1216233b68d0718f2d88af6a6ac7c9
1.6 GB Download
md5:06c9aeb4feba23c7c5e1144344dc991c
1.7 GB Download
md5:339af0f0fa715ba4d1b5f512b66ada90
8.3 GB Download
md5:eb63a708039ed6eb7c7dbd2e3ad841b7
8.1 GB Download
md5:908b6af529c172c41f6355cefd7dc1eb
11.5 GB Download
md5:cbe38658f2a6034a2ceca069ee2d528a
11.3 GB Download
md5:8b350cdc481e5e95f979f175a7ceb589
4.5 GB Download