Published April 22, 2021 | Version 3.0.1
Dataset Open

AnCora 3.0.1 Spanish

  • 1. Universitat de Barcelona

Description

AnCora 3.0.1 Spanish consist of 500,000 words annotated at different levels:

  • Lemma and Part of Speech
  • Syntactic constituents and functions
  • Argument structure and thematic roles
  • Semantic classes of the verb
  • Denotative type of deverbal nouns
  • Nouns related to WordNet synsets
  • Named Entities
  • Coreference relations
  • Implicit arguments of deverbal nominalizations

AnCora corpus is mainly based on journalist texts. For more information, click AnCora-corpus.

The annotators of AnCora are:

Esther Arias, Joan Aparicio, Oriol Borrega, Isabel Briz, Núria Bufí, Montserrat Civit, María Jesús Díaz, Silvia Garcia, Raquel Hernández, Marina Lloberes, Raquel Marcos, Difda Monterde, Borja Navarro, Montserrat Nofre, Aina Peris, Lourdes Puiggròs, Marta Recasens, Bàrbara Soriano, Rita Zaragoza.

 

 

Files

AnCora 3.0.1 Spanish.zip

Files (12.4 MB)

Name Size Download all
md5:f3aabe8403f6e0c879ff3b8a1c72e121
12.4 MB Preview Download