Benchamark for GRAP-MOT: Unsupervised Graph-based Position Weighted Person Multi-camera Multi-object Tracking in a Highly Congested Space
Authors/Creators
Description
This dataset is a large-scale collection of precisely annotated images, that can be used both for training and validating person multi-camera tracking algorithms. This benchmark focuses on the problem of multi-camera tracking of people in restricted areas like public transport or indoor locations. Our approach focuses on re-identification based on heads, thus only the heads of people were labelled.
During making the recordings, we made sure that no people were entering or leaving the area during recording a single take. The dataset consists of videos taken from 3 different angles. Each video is 256 frames long.
The videos were grouped by the number of people (from 2 to 15), that are in the recorded area. We have 10 recordings for each group giving in total 140 recordings.
Each image was later manually labelled in the form of bounding boxes. Bounding boxes of the same person in the same take between 3 cameras have the same id on all images.
Our frame is equipped with 3 cameras. One (Fish Eye) in the middle and two (regular ones) in the corners. The FE camera takes pictures in 1280X960 resolution, while the side ones are in 1920X1080 pixels. All images are stored as .jpg files.
Each image has a corresponding text file stored in labels.tar.gz file. Label files have a following structure.
Exemplary file:
1 0.455469 0.360938 0.039062 0.065625
0 0.247656 0.476042 0.031250 0.050000
2 0.671875 0.514062 0.051562 0.076042
1st column: number of if. It is unique fir each passenger on a take.
2nd column: X: relative horizontal distance from the left upper corner of an image to the middle of detection
3rd column: Y: relative vertical distance from the left upper corner of an image to the middle of detection
4th column: W relative width of the detection
5th column: H relative height of the detection
Unfortunately due to technical limitations could not publish the whole dataset at once. The full dataset can be found under this link "".
In this repository, we provide
-
Labelled recordings of people in the restricted area ranging from 2 to 15 on a single take.
- For each number of people, 10 takes were taken
-
Each take was taken from 3 overlapping cameras at the same time
Citing our work
and this Zenodo Dataset
@dataset{GRAP_MOT_zenodo,
author = {Marek Socha, Michał Marczyk, Aleksander Kempski, Michał Cogiel, Paweł Foszner, Michał Staniszewski},
title = {Benchamark for GRAP-MOT: Unsupervised Graph-based Position Weighted Person Multi-camera Multi-object Tracking in a Highly Congested Space},
month = DEC,
year = 2024,
publisher = {Zenodo},
version = {1.0.0},
doi = {10.5281/zenodo.14526116},
url = {https://zenodo.org/records/14526116}
}
Files
Files
(46.4 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:316e29a129979ff05065420c698581a9
|
4.2 GB | Download |
|
md5:72c970b06fbdc5632fad9feaf43a7e33
|
4.3 GB | Download |
|
md5:9933faf7573e579a579e32c6c7ff928f
|
4.5 GB | Download |
|
md5:281bc5d9e0d234d7f90bed08d77c4b96
|
3.9 GB | Download |
|
md5:df5e1405f8d355cf316a745bb049c3e0
|
4.5 GB | Download |
|
md5:2c3846205cc0420992e8221663494994
|
4.2 GB | Download |
|
md5:0dac1645e1e05f2cdc84ba5ea181939f
|
13.2 MB | Download |
|
md5:ad827fee00d1092c5efda656d1ef32fe
|
4.1 GB | Download |
|
md5:8c724b334df451b2766ef8bd59360f5e
|
4.0 GB | Download |
|
md5:fa6879132b3b27137cbc9a115a16e128
|
4.2 GB | Download |
|
md5:594bb8ae1b5ff562a076a4826d3279b7
|
4.4 GB | Download |
|
md5:1f80c3958e521a1c2da8673be8d6a60e
|
4.2 GB | Download |