Published April 12, 2023 | Version 1
Dataset Open

Analytical Center of University Cultural Productions in the Context of the Conflict (caPAZ) - Temporality

  • 1. Universidad del Rosario
  • 2. Universidad Católica de Pereira

Description

This dataset comprises a collection of journalistic articles written by young university students in Colombia, which is a product of the project: Analytical Center of University Cultural Productions in the Context of the Conflict (caPAZ), funded by the Ministry of Science, Technology and Innovation (Minciencias) and the National Center for Historical Memory (CNM) of Colombia (under the code: 1349-872-76354, agreement 872 of 2020) This corpus includes news written by the 24 colleges media of the Colombian Network of College Journalism from 2012 to 2016. The dataset includes digital news, for a total of 589 news items related to the armed conflict, the memory of the victims and the peace process in Colombia. These news items were collected through a web-scraping technique, using 3 lemmatized keywords (conflicto armado, memoria de las víctimas y proceso de paz), with the aim of identifying these regular expressions in the logical operators that run through the HTML structure of each Web page

Files

2012_DigitalTemporal.txt

Files (3.0 MB)

Name Size Download all
md5:c14259d635bc9b88b0db161922239537
49.2 kB Preview Download
md5:c4171d8ce07801a0bc474130ce3bf8a4
115.7 kB Preview Download
md5:dc0fff14641ced3bc5e9cb6f5a6eb2f8
349.2 kB Preview Download
md5:e2ccb272fb2d98a68ab2f36c08f091c7
624.0 kB Preview Download
md5:37c1b9bcca5fda887a4d19119dd889ae
1.8 MB Preview Download