Published April 26, 2023 | Version v2
Journal article Open

Modelling underreported Spatio-temporal Crime Events

  • 1. Universidad de los Andes
  • 2. University of Carnegie Melon
  • 3. Universidad Nacional de Colombia

Description

The code needed to replicate our work is available in our GitHub Repository

Description of the files

  • distance_1000.csv: is a data frame with 5000 rows and 3 columns. Each row is a time step of the algorithms, and reports the euclidean distance between the vector with the real crime rate in each cell and the estimation made by the algorithm. The exercise was performed in the case of 1,000 arms and at most 100 super arms. This file is created in the times.py script of our repository.
  • distance_10000.csv: is a data frame with 5000 rows and 3 columns. Each row is a time step of the algorithms, and reports the euclidean distance between the vector with the real crime rate in each cell and the estimation made by the algorithm. The exercise was performed in the case of 10,000 arms and at most 1,000 super arms. This file is created in the times.py script of our repository.
  • distance_50000.csv: is a data frame with 5000 rows and 3 columns. Each row is a time step of the algorithms, and reports the euclidean distance between the vector with the real crime rate in each cell and the estimation made by the algorithm. The exercise was performed in the case of 50,000 arms and at most 5,000 super arms. This file is created in the times.py script of our repository.
  • grilla_bogota.csv: is a data frame with 1638 rows and 5 columns in which each row described one grid of Bogotá. The difference between this file and grilla_bogota2.csv is that this file is used to plot Figure 9 which includes the rural area of the city. Something that is removed in our analysis due to the low density of crime in this zone. This file is created in the 3_create_grid.ipynb script of our repository.
  • grilla_bogota2.csv: is a data frame with 1008 rows and 10 columns in which each row described one grid of Bogotá. This file is more complete than grilla_bogota.csv because it includes the name of the Localidad in which the centroid of the cell belongs and its Rep. Rate. However, this file does not contain the rural area of the city. This file is created in the 3_create_grid.ipynb script of our repository.
  • localidades.zip: this zipped folder contains the shapefiles to draw the map of Bogotá with its respective administrative limits. The information contained herein is of a public nature and can also be found on the government's open data page.
  • matriz_eventos_real.csv: is a matrix of 498 rows and 368 columns in which each row represents one cell of Bogota's grid and each column represents the number of real crimes for each date. Recall that we assume that the total of crimes is the combination of NUSE and SIEDCO crimes after the removal of duplicates. This file is created in the 3_create_grid.ipynb script of our repository.
  • matriz_eventos_subreporte.csv: is a matrix of 498 rows and 368 columns in which each row represents one cell of Bogota's grid and each column represents the number of subreported crimes for each date. Recall that we assume that the number of sub-reported crimes is the number of crimes reported in NUSE. This file is created in the 3_create_grid.ipynb script of our repository.
  • subreporte_ccb.csv: is a data frame of 498 rows and 4 columns that describe the Rep. Rate and lambda for each cell of Bogota's grid. This file is created in the 3_create_grid.ipynb script of our repository.
  • upla.zip: this zipped folder contains other extra shapefiles to draw the map of Bogotá with its respective administrative limits. The information contained herein is of a public nature and can also be found on the government's open data page.
  • victimización.xlsx: is an Excel file with 20 rows and 4 columns that contains the Vict. Rate and the Rep. Rate for each Localidad of Bogotá. This information comes from survey-based victimization and victim crime reporting rates presented by Bogotá’s Chamber of commerce (2014).

Files

distance_1000.csv

Files (4.6 MB)

Name Size Download all
md5:f4507ca5c4313cdf7100dc7131fc30b8
294.0 kB Preview Download
md5:d10f64916b24b850b3313fc8cb50b1d4
192.1 kB Preview Download
md5:44bed7e8e294053f88b2e0852d132a33
202.6 kB Preview Download
md5:5c40c4ef031909465f8333bcd968a8ef
472.1 kB Preview Download
md5:2b5f8197a1d597ece9cb00979843a1ff
343.1 kB Preview Download
md5:62dc33b028e46a21d617cac2bbd25c71
635.3 kB Preview Download
md5:66636503dd4cc8af2b485ad55f469e95
744.4 kB Preview Download
md5:fde08f36fd6c1e51463fd4cba53053a3
744.1 kB Preview Download
md5:1eca631e15cad096b03f6bed6b003c52
11.6 kB Preview Download
md5:33febf813b109de468e2d4adfe49fa56
935.0 kB Preview Download
md5:ada21310b84043f76772912644850e20
11.4 kB Download