Published February 1, 2025 | Version v1
Dataset Open

OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles

Description

Introduction

This dataset provides a comprehensive assessment of public transport connectivity across Germany by analyzing both walking distances to the nearest public transport stops as well as the quality of public transport connections for daily usage scenarios with housing-level-granularity on a country-wide scale. The data was generated through a novel approach that integrates multiple open data sources, simulation models, and visual analytics techniques, enabling researchers, policymakers, and urban planners to identify gaps and opportunities for transit network improvements. \newline

Why does it matter?

Efficient and accessible public transportation is a critical component of sustainable urban development. However, many transit networks struggle to adequately serve diverse populations due to infrastructural, financial, and urban planning limitations. Traditional transit planning often relies on aggregated statistics, expert opinions, or limited surveys, making it difficult to assess transport accessibility at an individual household level. This dataset provides a data-driven and reproducible methodology for unbiased country-wide comparisons.

Find more information at https://mobility.dbvis.de.

Key Facts, Download, Citation

Title OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles
Acronym OPTIMAP
Download https://mobility.dbvis.de/data-results/OPTIMAP_v2025-02-01.parquet (478MB, parquet)
License Datenlizenz Deutschland - Namensnennung - Version 2.0 (dl-de-by/2.0)

 

Please cite the dataset as:

Maximilian T. Fischer, Daniel Fürst, Yannick Metz, Manuel Schmidt, Julius Rauscher, and Daniel A. Keim. OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles. Zenodo, 2025. doi: 10.5281/zenodo.14772646.

or, when using Bibtex

@dataset{MobilityProfiles.DatasetGermany.2025,
  author       = {Fischer, Maximilian T. and
                  Fürst, Daniel and
                  Metz, Yannick and
                  Schmidt, Manuel and
                  Rauscher, Julius and
                  Keim, Daniel A.},
  title        = {OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles},
  year         = 2025,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.14772646}
}

Dataset Description

The dataset in the PARQUET format includes detailed accessibility measures for public transport at a fine-grained, housing-level resolution. It consists of four columns:

  • lat, lng (float32): GPS coordinates (EPSG:4326) of each house in Germany, expensively compiled from the house coordinates (HK-DE) data provided by the 16 federal states under the EU INSPIRE regulations.
  • MinDistanceWalking (int32): An approximate walking distance (in meters) to the nearest public transport stop from each registered building in Germany.
  • scores_OVERALL (float32): A simulated, demographic- and scenario-weighted measure of public transport quality for daily usage, considering travel times, frequency, and coverage across various daily scenarios (e.g., commuting, shopping, medical visits). The results are represented in an artificial time unit to allow comparative analysis across locations.

Methodology

The dataset was generated using a combination of open geospatial data and advanced transport simulation techniques.

  • Data Sources: Public transit information from the German national access point (DELFI NeTEx), housing geolocation data from various state authorities, and routing information from OpenStreetMap.
  • Walking Distance Calculation: The shortest path to the nearest transit stop was computed using the Dijkstra algorithm on a graph network of publicly available pathways sourced from OSM, considering the ten aerial-nearest public transport stops.
  • Public Transport Quality Estimation: The dataset incorporates a scenario-based simulation model, analyzing weight-averaged travel times and connection frequency to typical daily POIs such as the individually nearest train stations, kindergartens, schools, institutions of higher education, fitness centers, cinemas, places of worship, supermarkets, shopping malls, restaurants, doctors, parks, and cultural institutions. It includes walking distances to the start and from the destination public transport stops as well as the averaged travel and waiting times on the shortest route calculated via a modified Dijkstra algorithm. The results are aggregated using a demographically- and scenario-weighted metric to ensure comparability. The value is in the unit of time, although it should not be interpreted directly as real minutes.
  • Visualization and Validation: A WebGL-based interactive tool and static precomputed maps were developed to allow users to interactively explore transport accessibility metrics dynamically, available at https://mobility.dbvis.de.

Potential Applications

The dataset enables multiple use cases across research, policy, and urban planning:

  • Public Accessibility Studies: Provides insights into transport equity by evaluating mobility gaps affecting different demographic groups, different regional areas, and comparing county and state efforts in improving public transport quality.
  • Urban Planning and Transport Policy: Supports data-driven decision-making for optimizing transit networks, adjusting service schedules, or identifying underserved areas.
  • Smart City Development: Assists in integrating mobility analytics into broader smart city initiatives for efficient resource allocation and sustainability planning.
  • Academic Research: Facilitates studies in transportation engineering, urban geography, and mobility behavior analysis.

Conclusion

By offering high-resolution public transport accessibility data at housing-level granularity, this dataset contributes to a more transparent and objective understanding of urban mobility challenges. The integration of simulation models, demographic considerations, and scalable analytics provides a novel approach to evaluating and improving public transit systems. Researchers, city officials, and policymakers are encouraged to leverage this dataset to enhance transport infrastructure planning and accessibility.

This dataset contains both the approximate walking distances in meters and a weighted overall quality score in an artificial time unit for each individual house in Germany. More advanced versions are currently not publicly available. This base dataset is publicly available and adheres to open data licensing principles, enabling its reuse for scientific and policy-oriented studies.

Source Data Licenses

While not part of this dataset, the scientific simulation used to create the results leverages public transit information via the National Access Point (NAP) DELFI as NeTEx, provided via GTFS feeds of Germany (CC BY 4.0).

Also, routing information used during the processing was based on Open Street Map contributors (CC BY 4.0).

Primarily, this dataset contains original and slightly processed housing locations (lat, lng) that were made available as part of the EU INSPIRE regulations, based on Directive (EU) 2019/1024 (of the European Parliament and of the Council of 20 June 2019 on open data and the re-use of public sector information (recast)).

In Germany, the respective data is provided individually by the 16 federal states, with the following required attributions and license indications:

  • BB: EU INSPIRE / © GeoBasis-DE/LGB, dl-de-by/2.0 (data modified)
  • BE: EU INSPIRE / © Geoportal Berlin / Hauskoordinaten, dl-de-by/2.0 (data modified)
  • BW: EU INSPIRE / © LGL, www.lgl-bw.de, dl-de-by/2.0 (data modified)
  • BY: EU INSPIRE / © Bayerische Vermessungsverwaltung, CC BY 4.0 (data modified)
  • HB: EU INSPIRE / © Landesamt GeoInformation Bremen, CC BY 4.0 (data modified)
  • HE: EU INSPIRE / © HVBG, dl-de-by-zero/2.0 (data modified)
  • HH: EU INSPIRE / © FHH (LGV), dl-de-by/2.0 (data modified)
  • MV: EU INSPIRE / © LAiV M-V, CC BY 4.0 (data modified)
  • NI: EU INSPIRE / © LGLN 2024, CC BY 4.0 (data modified)
  • NW: EU INSPIRE / © Geobasis NRW, dl-de-by-zero/2.0 (data modified)
  • RP: EU INSPIRE / © GeoBasis-DE / LVermGeoRP 2024, dl-de-by/2.0 (data modified)
  • SH: EU INSPIRE / © GeoBasis-DE/LVermGeo SH, CC BY 4.0 (data modified)
  • SL: EU INSPIRE / © GeoBasis DE/LVGL-SL (2024), dl-de-by/2.0 (data modified)
  • SN: EU INSPIRE / © GeoSN, dl-de-by/2.0 (data modified)
  • ST: EU INSPIRE / © GeoBasis-DE / LVermGeo LSA, dl-de-by/2.0 (data modified)
  • TH: EU INSPIRE / © GDI-Th, dl-de-by/2.0 (data modified)

Original Research

The methodology and techniques are described in an original research article published in 2024. When referring to our approach, please cite the following publication:

Yannick Metz, Dennis Ackermann, Daniel A. Keim, and Maximilian T. Fischer. Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent. In: 2024 IEEE Visualization in Data Science (VDS). VDS. IEEE, 2024, p. 9. doi: 10.1109/VDS63897.2024.00006

or, when using bibtex:

@inproceedings{MobilityProfiles.VDS.2024,
    author = {Metz, Yannick and Ackermann, Dennis and Keim, Daniel A. and Fischer, Maximilian T.},
    title = {Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent}, 
    booktitle = {2024 IEEE Visualization in Data Science (VDS)},
    doi = {10.1109/VDS63897.2024.00006},
    pages = {9},
    publisher = {IEEE},
    series = {VDS},
    year = {2024}
}

Files

OPTIMAP_data_card.pdf

Files (174.0 kB)

Name Size Download all
md5:2dd316fbb9edae3d899e1910974ce272
174.0 kB Preview Download

Additional details

Related works

Is supplement to
Publication: 10.1109/VDS63897.2024.00006 (DOI)