There is a newer version of the record available.

Published July 8, 2024 | Version v1
Dataset Open

VIOGEN_DATASET

Description

The dataset is composed of three distinct files which aggregate processed data derived from open datasets of three cities: Dublin, San Francisco, and Valencia. The data has been mapped to a grid of 25m² for Valencia and 50m² for Dublin and San Francisco. The respective files are named DATA_ES_VLC.csv, DATA_IE_DUB.csv, and DATA_US_SFO.csv. Additionally, there is a dataset for tweets named DATA_TWT.csv, which contains tweets collected through web scraping and analysed using natural language processing (NLP) algorithms and neural networks. The aim is to identify and classify tweets that discuss gender-based violence in the city of Valencia. Another file, MAP_ES_VLC.csv, includes points collected during various mapathons conducted by the Polytechnic University of Valencia campus for a science project aimed at identifying potentially insecure locations.

Files

DATA_ES_VLC.csv

Files (195.0 MB)

Name Size Download all
md5:ca071f1be826a0eb172940eb5f1a738c
69.2 MB Preview Download
md5:3878c0c647cac3fee34fbf73dde9f8ac
60.9 MB Preview Download
md5:98686d60d487098b63ff7ed23223c168
61.0 MB Preview Download
md5:6d25584e064ecf328e9604793d557286
25.9 kB Download
md5:5e2923723e884f3f14fe1cc961e82146
34.2 kB Preview Download
md5:def955402dfb4114614fc6eec8837371
3.9 MB Preview Download

Additional details

Funding

Conselleria de Innovación, Universidades, Ciencia y Sociedad Digital
AICO2022 CIAICO/2021/292

Software

Repository URL
https://github.com/Carma64c/CriteriaTaronja
Programming language
Python
Development Status
Active