VIOGEN_DATASET
Authors/Creators
Description
The dataset is composed of three distinct files which aggregate processed data derived from open datasets of three cities: Dublin, San Francisco, and Valencia. The data has been mapped to a grid of 25m² for Valencia and 50m² for Dublin and San Francisco. The respective files are named DATA_ES_VLC.csv, DATA_IE_DUB.csv, and DATA_US_SFO.csv. Additionally, there is a dataset for tweets named DATA_TWT.csv, which contains tweets collected through web scraping and analysed using natural language processing (NLP) algorithms and neural networks. The aim is to identify and classify tweets that discuss gender-based violence in the city of Valencia. Another file, MAP_ES_VLC.csv, includes points collected during various mapathons conducted by the Polytechnic University of Valencia campus for a science project aimed at identifying potentially insecure locations.
Files
DATA_ES_VLC.csv
Files
(195.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:ca071f1be826a0eb172940eb5f1a738c
|
69.2 MB | Preview Download |
|
md5:3878c0c647cac3fee34fbf73dde9f8ac
|
60.9 MB | Preview Download |
|
md5:98686d60d487098b63ff7ed23223c168
|
61.0 MB | Preview Download |
|
md5:6d25584e064ecf328e9604793d557286
|
25.9 kB | Download |
|
md5:5e2923723e884f3f14fe1cc961e82146
|
34.2 kB | Preview Download |
|
md5:def955402dfb4114614fc6eec8837371
|
3.9 MB | Preview Download |
Additional details
Funding
- Conselleria de Innovación, Universidades, Ciencia y Sociedad Digital
- AICO2022 CIAICO/2021/292
Software
- Repository URL
- https://github.com/Carma64c/CriteriaTaronja
- Programming language
- Python
- Development Status
- Active