GeoChanges QA
Creators
- 1. Dept. of Informatics and Telecommunications, National and Kapodistrian University of Athens
- 2. L3S Research Center, Leibniz Universit ̈at Hannover, Germany
- 3. Dept. of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece
Description
This work introduces GeoChangesQA, a novel spatiotemporal QA dataset for historical geospatial knowledge. We first developed the Historical County Boundaries Ontology (HCB-O) to create this dataset. Then, we leveraged geospatial historical data spanning regions in the United States from 1630 to 2000 to populate the novel Historical County
Boundaries Knowledge Graph (HCB-KG). Subsequently, we developed a semi-automated procedure for generating questions, GeoSPARQL queries, and corresponding answers over HCB-KG by leveraging subgraph and query template extraction techniques. Through this method, we automatically generated 8, 900 question-query-answer triples.
The code used to create this dataset is available here: github repo
The knowledge graph (rdf dumps) which the queries refer to is available here: DOI 10.5281/zenodo.11508198 // Knowledge Graph
Series information (English)
Columns Description:
- Subgraph ID: Unique Subgraph type id
- SparqlQuery: The SPARQL query
- Questions: The Question after replacing Uris_match with template variables
- Sample Answer: One sample answer to the SPARQL query
Question_Templates:
- Subgraph ID: Unique Subgraph type id
- Graph: The graph schema for every Subgraph ID
- Questions Templates: A list with every question template
Notes (English)
Files
GeoChanges_QA_Dataset_v5.csv
Files
(7.9 MB)
Name | Size | Download all |
---|---|---|
md5:a492520d37a9031285f694653ddde0c6
|
7.9 MB | Preview Download |
md5:caff4c89c44ff751fcb8628428d0e8f0
|
18.5 kB | Preview Download |