There is a newer version of the record available.

Published October 4, 2024 | Version v1

Integrated Datasets for analyses on potentially hazardous locations for women in Valencia, Dublin, San Francisco, and Toluca

Description

This dataset provides a compilation of the data used to analyze and identify potentially dangerous
places for women. Multiple data collection techniques, including official data downloads, web
scraping, and participatory mapping, were combined for integration, applying specific processing.
The datasets refer to four cities: Valencia (Spain), Dublin (Ireland), San Francisco (United States),
and Toluca (Mexico).
Depending on the availability and context of each city, the datasets are classified into three
categories: DATA, TWT, and MAP. The DATA prefix refers to files containing the results of the
analysis of socioeconomic variables downloaded from official sources; for the mapping, the
standard territorial unit was a 25x25 m grid for Valencia and 50x50 m for Dublin and San Francisco.
The files with the prefix TWT are composed of datasets containing tweets collected through web
scraping and analyzed using natural language processing (NLP) algorithms and neural networks;
the purpose is to identify and classify tweets related to gender violence, feelings of fear, or
perceptions of insecurity. For MAP files, participants gathered them through participatory
mapping processes, using specific calls to public space users and a supporting web application
designed for this purpose.

Files

data dictionary.pdf

Files (189.8 MB)

Name Size Download all
md5:e2a535f922229dd44274f8ef0629cf7a
131.3 kB Preview Download
md5:e3bb5b997d9ca7752a6f63fa78a62b02
66.8 MB Preview Download
md5:85f08eef708c2eb01faf9967bfa1384e
58.0 MB Preview Download
md5:adfb8201eb776997124161ce3058391a
60.6 MB Preview Download
md5:378fe2baca5b2fc9cad89f512e07b670
34.4 kB Preview Download
md5:5251bdc291018247ee8d04a434149b3b
389.7 kB Preview Download
md5:370d234e58d19ad7ea2210be93c6ebd2
3.8 MB Preview Download
md5:52dc09c12c4f1a19efe8ccd794e53e71
56.5 kB Preview Download

Additional details

Funding

Conselleria de Innovación, Universidades, Ciencia y Sociedad Digital
AICO2022 CIAICO/2021/292

Software

Repository URL
https://github.com/Carma64c/CriteriaTaronja
Programming language
Python
Development Status
Active