Published May 30, 2018 | Version v1
Dataset Open

The 12 Million Most Frequent English Grammatical Relations and their Frequencies

  • 1. University of Helsinki

Description

A dataset of English grammatical relations obtained from UkWac corpus, parsed using Spacy. Each line in the repository represents a grammatical relation (a word/dependent, its head/governor, the parts-of-speech of both words and the type of relation) and the number of times it occurred in the corpus.

Files

dep_freq_result_sorted_gt_10.txt

Files (462.9 MB)

Name Size Download all
md5:96df410f3cf4a71e87c9f73cccc35cbc
369.6 MB Preview Download
md5:2d5b9e96a151e61526858158e9d16cf0
93.3 MB Preview Download