Analytical Center of University Cultural Productions in the Context of the Conflict (caPAZ)
Description
This dataset comprises a collection of journalistic articles written by young university students in Colombia, which is a product of the project: Analytical Center of University Cultural Productions in the Context of the Conflict (caPAZ), funded by the Ministry of Science, Technology and Innovation (Minciencias) and the National Center for Historical Memory (CNM) of Colombia (under the code: 1349-872-76354, agreement 872 of 2020) This corpus includes news written by the 8 colleges media of the Colombian Network of College Journalism from 2001 to 2021. The dataset includes digital news, for a total of 2373 news items related to the armed conflict, the memory of the victims and the peace process in Colombia. These news items were collected through a web-scraping technique, using 3 lemmatized keywords (conflicto armado, memoria de las víctimas y proceso de paz), with the aim of identifying these regular expressions in the logical operators that run through the HTML structure of each Web page
Files
rosario_DigitalMedio.txt
Files
(10.0 MB)
Name | Size | Download all |
---|---|---|
md5:ed1db57c506e6cf96e8fba544dcd5f7b
|
1.1 MB | Preview Download |
md5:944c22673c52736816870be2ecfea52c
|
482.2 kB | Preview Download |
md5:f4ad053bfa4a8de2cb295ba96ef1598e
|
4.2 MB | Preview Download |
md5:3e37a55e6e2eece9aa1f9675654506e2
|
1.3 MB | Preview Download |
md5:8ea2fabbfe5967b8580e7de09d4783f6
|
1.5 MB | Preview Download |
md5:eb72a23afa1c00401d87e5eb3ece670f
|
1.1 MB | Preview Download |
md5:b30505f18bcbc88aa30d7a1378691394
|
466.2 kB | Preview Download |