Urban Traffic Speed Dataset of Guangzhou, China
Description
This is an urban traffic speed dataset which consists of 214 anonymous road segments (mainly consist of urban expressways and arterials) within two months (i.e., 61 days from August 1, 2016 to September 30, 2016) at 10-minute interval, and the speed observations were collected from Guangzhou, China. In practice, it can be used to evaluate several missing data recovery, short-term traffic prediction and traffic pattern discovery methods.
According to the spatial and temporal attributes, we can easily derive a third-order tensor as \(\mathcal{X}\in\mathbb{R}^{214\times 61\times 144}\) and its dimensions include road segment, day and time window (see the file tensor.mat). The total number of speed observations (or non-zero entries of the tensor \(\mathcal{X}\)) is \(1,855,589\). If the dataset is complete, then we have \(214\times 61\times 144=1,879,776\) observations, therefore, the original missing rate of this dataset is \(1.29\%\).
Note that the file traffic_speed_data.csv is the original traffic speed data with four columns including road segment attribute, day attribute, time window attribute and traffic speed value. The file day_information_table.csv is a table referring to the specific date, and the file time_information_table.csv is a table expressing time window with start time and end time information.
Files
Additional details
References
- Xinyu Chen, Zhaocheng He, Jiawei Wang, 2018. Spatial-temporal traffic speed patterns discovery and incomplete data recovery via SVD-combined tensor decomposition. Transportation Research Part C: Emerging Technologies, 86, 59-77.