Published April 5, 2022 | Version 1.0
Dataset Open

NewsCom-NEG Corpus

  • 1. Universitat de Barcelona

Description

The NewsCom corpus consists of 2955 comments posted in response to 18 different news articles obtained from online Spanish newspapers from August 2017 to May 2019. These news articles cover nine topics (two articles per topic): immigration, politics, technology, terrorism, economy, society, religion, refugees, and real estate. The NewsCom corpus contains 2965 negative structures with their corresponding negation marker, scope, and focus. It is a valuable resource that can be used both for the training and evaluation of systems that aim to automatically detect the scope and focus of negation and for the linguistic analysis of negation grounded in real data.

Files

NewsCom-NEG.zip

Files (394 Bytes)

Name Size Download all
md5:fb5211549d4ac02ba1fb156a02a48b96
394 Bytes Preview Download

Additional details

References

  • Taulé, M., Nofre, M., González, M., & Martí, M. (2021). Focus of negation: Its identification in Spanish. Natural Language Engineering, 27(2), 131-152. doi:10.1017/S1351324920000388