Service Incident: New DOI registrations are working again. Re-registration of failed DOI registrations (~500) are still affected by the service incident at DataCite (our DOI registration agency).
Published March 25, 2022 | Version v1
Dataset Open

Analytical Center of University Cultural Productions in the Context of the Conflict (caPAZ)

  • 1. Universidad del Rosario

Contributors

Hosting institution:

  • 1. Universidad Católica de Pereira

Description

This dataset comprises a collection of journalistic articles written by young university students in Colombia, which is a product of the project: Analytical Center of University Cultural Productions in the Context of the Conflict (caPAZ), funded by the Ministry of Science, Technology and Innovation (Minciencias) and the National Center for Historical Memory (CNM) of Colombia (under the code: 1349-872-76354, agreement 872 of 2020) This corpus includes news written by the 8 colleges media of the Colombian Network of College Journalism from 2001 to 2021. The dataset includes digital news, for a total of 2373 news items related to the armed conflict, the memory of the victims and the peace process in Colombia. These news items were collected through a web-scraping technique, using 3 lemmatized keywords (conflicto armado, memoria de las víctimas y proceso de paz), with the aim of identifying these regular expressions in the logical operators that run through the HTML structure of each Web page

Files

rosario_DigitalMedio.txt

Files (10.0 MB)

Name Size Download all
md5:ed1db57c506e6cf96e8fba544dcd5f7b
1.1 MB Preview Download
md5:944c22673c52736816870be2ecfea52c
482.2 kB Preview Download
md5:f4ad053bfa4a8de2cb295ba96ef1598e
4.2 MB Preview Download
md5:3e37a55e6e2eece9aa1f9675654506e2
1.3 MB Preview Download
md5:8ea2fabbfe5967b8580e7de09d4783f6
1.5 MB Preview Download
md5:eb72a23afa1c00401d87e5eb3ece670f
1.1 MB Preview Download
md5:b30505f18bcbc88aa30d7a1378691394
466.2 kB Preview Download