Published February 11, 2024 | Version 1.0
Dataset Open

Integrated Agent-based Modelling and Simulation of Transportation Demand and Mobility Patterns in Sweden

  • 1. Department of Space, Earth and Environment, Division of Physical Resource Theory, Chalmers University of Technology, Gothenburg, Sweden

Description

About

The Synthetic Sweden Mobility (SySMo) model provides a simplified yet statistically realistic microscopic representation of the real population of Sweden. The agents in this synthetic population contain socioeconomic attributes, household characteristics, and corresponding activity plans for an average weekday. This agent-based modelling approach derives the transportation demand from the agents’ planned activities using various transport modes (e.g., car, public transport, bike, and walking).

This open data repository contains four datasets: 

(1) Synthetic Agents, 

(2) Activity Plans of the Agents,  

(3) Travel Trajectories of the Agents, and  

(4) Road Network (EPSG: 3006)

(OpenStreetMap data were retrieved on August 28, 2023, from https://download.geofabrik.de/europe.html, and GTFS data were retrieved on September 6, 2023 from https://samtrafiken.se/)

The database can serve as input to assess the potential impacts of new transportation technologies, infrastructure changes, and policy interventions on the mobility patterns of the Swedish population.

Methodology

This dataset contains statistically simulated 10.2 million agents representing the population of Sweden, their socio-economic characteristics and the activity plan for an average weekday. For preparing data for the MATSim simulation, we randomly divided all the agents into 10 batches. Each batch's agents are then simulated in MATSim using the multi-modal network combining road networks and public transit data in Sweden using the package pt2matsim (https://github.com/matsim-org/pt2matsim).  

The agents' daily activity plans along with the road network serve as the primary inputs in the MATSim environment which ensures iterative replanning while aiming for a convergence on optimal activity plans for all the agents. Subsequently, the individual mobility trajectories of the agents from the MATSim simulation are retrieved.

The activity plans of the individual agents extracted from the MATSim simulation output data are then further processed. All agents with negative utility score and negative activity time corresponding to at least one activity are filtered out as the ‘infeasible’ agents. The dataset ‘Synthetic Agents’ contains all synthetic agents regardless of their feasibility’ (0=excluded & 1=included in plans and trajectories). In the other datasets, only agents with feasible activity plans are included.

The simulation setup adheres to the MATSim 13.0 benchmark scenario, with slight adjustments. The strategy for replanning integrates BestScore (60%), TimeAllocationMutator (30%), and ReRoute (10%)— the percentages denote the proportion of agents utilizing these strategies. In each iteration of the simulation, the agents adopt these strategies to adjust their activity plans. The "BestScore" strategy retains the plan with the highest score from the previous iteration, selecting the most successful strategy an agent has employed up until that point. The "TimeAllocationMutator" modifies the end times of activities by introducing random shifts within a specified range, allowing for the exploration of different schedules. The "ReRoute" strategy enables agents to alter their current routes, potentially optimizing travel based on updated information or preferences. These strategies are detailed further in W. Axhausen et al. (2016) work, which provides comprehensive insights into their implementation and impact within the context of transport simulation modeling. 

Data Description

(1) Synthetic Agents

This dataset contains all agents in Sweden and their socioeconomic characteristics.  

The attribute ‘feasibility’ has two categories: feasible agents (73%), and infeasible agents (27%). Infeasible agents are agents with negative utility score and negative activity time corresponding to at least one activity. 

File name: 1_syn_pop_all.parquet

Column

Description

Data type

Unit

PId

Agent ID

Integer

-

Deso Zone code of Demographic statistical areas (DeSO)1 String -
kommun
Municipality code Integer -
marital 
Marital Status (single/ couple/ child) String -
sex 
Gender (0 = Male, 1 = Female) Integer -
age
Age Integer -
HId
A unique identifier for households Integer -
HHtype  
Type of households (single/ couple/ other) String -
HHsize  
Number of people living in the households Integer -
num_babies
Number of children less than six years old in the household Integer -
employment Employment Status (0 = Not Employed, 1 = Employed) Integer -
studenthood Studenthood Status (0 = Not Student, 1 = Student) Integer -
income_class Income Class (0 = No Income, 1 = Low Income, 2 = Lower-middle Income, 3 = Upper-middle Income, 4 = High Income) Integer -
num_cars Number of cars owned by an individual  Integer -
HHcars Number of cars in the household Integer -
feasibility
Status of the individual (1=feasible, 0=infeasible) Integer -

1 https://www.scb.se/vara-tjanster/oppna-data/oppna-geodata/deso--demografiska-statistikomraden/

(2) Activity Plans of the Agents

The dataset contains the car agents’ (agents that use cars on the simulated day) activity plans for a simulated average weekday.   

File name: 2_plans_i.parquet, i = 0, 1, 2, ..., 8, 9. (10 files in total)

Column

Description

Data type

Unit

act_purpose

Activity purpose (work/ home/ school/ other)

String

-

PId

Agent ID

Integer

-

act_end 

End time of activity (0:00:00 – 23:59:59)

String

hour:minute:seco

nd

act_id

Activity index of each agent

Integer

-

mode

Transport mode to reach the activity location

String

-

POINT_X        

Coordinate X of activity location (SWEREF99TM)

Float

metre

POINT_Y

Coordinate Y of activity location (SWEREF99TM)

Float

metre

dep_time    

Departure time (0:00:00 – 23:59:59)

String

hour:minute:seco

nd

score

Utility score of the simulation day as obtained from MATSim

Float

-

trav_time   

Travel time to reach the activity location

String

hour:minute:seco

nd

trav_time_min    

Travel time in decimal minute

Float

minute

act_time 

Activity duration in decimal minute

Float

minute

distance

Travel distance between the origin and the destination

Float

km

speed

Travel speed to reach the activity location

Float

km/h

(3) Travel Trajectories of the Agents

This dataset contains the driving trajectories of all the agents on the road network, and the public transit vehicles used by these agents, including buses, ferries, trams etc. The files are produced by MATSim simulations and organised into 10 *.parquet’ files (representing different batches of simulation) corresponding to each plan file.

File name: 3_events_i.parquet, i = 0, 1, 2, ..., 8, 9. (10 files in total)

 

Column 

Description 

Data type 

Unit 

time 

Time in second in a simulation day (0-86399) 

Integer 

second 

type 

Event type defined by MATSim simulation* 

String 

person 

Agent ID 

Integer 

link 

Nearest road link consistent with the road network 

String 

vehicle 

Vehicle ID identical to person 

Integer 

from_node      

Start node of the link 

Integer 

to_node   

End node of the link 

Integer 

* One typical episode of MATSim simulation events: Activity ends (actend) -> Agent’s vehicle enters traffic (vehicle enters traffic) -> Agent’s vehicle moves from previous road segment to its next connected one (left link) -> Agent’s vehicle leaves traffic for activity (vehicle leaves traffic) -> Activity starts (actstart) 

(4) Road Network

This dataset contains the road network.

File name: 4_network.shp

Column 

Description 

Data type 

Unit 

length 

The length of road link 

Float 

metre 

freespeed 

Free speed 

Float 

km/h 

capacity 

Number of vehicles 

Integer 

permlanes 

Number of lanes 

Integer 

oneway 

Whether the segment is one-way (0=no, 1=yes) 

Integer 

modes 

Transport mode 

String 

from_node 

Start node of the link 

Integer 

to_node 

End node of the link 

Integer 

geometry 

LINESTRING (SWEREF99TM) 

geometry 

metre 

 

Additional Notes

This research is funded by the RISE Research Institutes of Sweden, the Swedish Research Council for Sustainable Development (Formas, project number 2018-01768), and Transport Area of Advance, Chalmers.

Contributions

YL designed the simulation, analyzed the simulation data, and, along with CT, executed the simulation. CT, SD, FS, and SY conceptualized the model (SySMo), with CT and SD further developing the model to produce agents and their activity plans. KG wrote the data document. All authors reviewed, edited, and approved the final document.

Files

Files (18.1 GB)

Name Size Download all
md5:ecd72d88d0b650f584d1712d4c80b3dd
94.8 MB Download
md5:69115a72728ac87ec9088cae3ed0baaf
116.4 MB Download
md5:45677c87ae4749a8e3690f730e7af8df
116.2 MB Download
md5:f9ef640c99e9564e552e6f4034e3b4d2
116.2 MB Download
md5:c360094ddd17aedc92b2211de49f59c7
116.4 MB Download
md5:2b822e46267cad5a960cf7ac45c856a1
115.8 MB Download
md5:6cc786e7b16c94b4f870427798c42bfe
115.7 MB Download
md5:71b5d93092dabe4db5625721a130132e
116.3 MB Download
md5:9070cfdc34a443631977a7d4a43ceec5
116.2 MB Download
md5:46bafe5f5acccb17b3908ea9925a8fe5
116.2 MB Download
md5:6b2b2fdb1d2dc6d5201fceb802af3571
116.2 MB Download
md5:8d58201b2b9aeef44be2fa3708acc4fd
1.5 GB Download
md5:5a2e2f6cac291a9c7c0884c8199a24ed
1.5 GB Download
md5:0e5d108d30a50063a58ad28ad1d79506
1.5 GB Download
md5:04fc73013f4992d4f5a42dbffdf79ba3
1.5 GB Download
md5:57cb31728d39b8dc4d29f8d7d249a9cc
1.5 GB Download
md5:f8fbe37115aac1fbc5adabe1e08f774a
1.5 GB Download
md5:9ded18e6fdf6c01430b8cef10f6dee1c
1.5 GB Download
md5:834ae52e47970705c962651c6c9071e8
1.5 GB Download
md5:edc1d8895565a991612baf0611e83f71
1.5 GB Download
md5:6f7e0d1d1e5090741a6c5566eda2cc0c
1.5 GB Download
md5:0e4bb4efc30ec129e5003a18ea2103cf
10 Bytes Download
md5:753a6c497669148ecd2018b919a44a95
1.2 GB Download
md5:d4ec5348867a8613da0c2ae604d09546
391 Bytes Download
md5:f815ac0048a88ade9ae076fbfc43d341
217.4 MB Download
md5:61131c6231cf6edd49d4a97d38aaa79f
19.8 MB Download

Additional details

Related works

Software

Repository URL
https://github.com/TheYuanLiao/sysmo-data-pt
Programming language
Jupyter Notebook, Python, Java
Development Status
Active

References

  • Tozluoğlu, Ç., Dhamal, S., Yeh, S., Sprei, F., Liao, Y., Marathe, M., Barrett, C. L., & Dubhashi, D. (2023). A synthetic population of Sweden: datasets of agents, households, and activity-travel patterns. Data in Brief, 48, 109209. https://doi.org/10.1016/J.DIB.2023.109209.
  • Liao, Y., Tozluoglu, C., Sprei, F., Yeh, S., & Dhamal, S. (2023). Open synthetic data on travel and charging demand of battery electric cars: An agent-based simulation on three charging behavior archetypes. https://doi.org/10.5281/ZENODO.7549847
  • Beser, M., & Algers, S. (2002). SAMPERS—The new Swedish national travel demand forecasting tool. In National transport models: Recent developments and prospects (pp. 101-118). Berlin, Heidelberg: Springer Berlin Heidelberg
  • W Axhausen, K., Horni, A., & Nagel, K. (2016). The multi-agent transport simulation MATSim (p. 618). Ubiquity Press.