Published May 11, 2020 | Version v1, 11.05.2020
Dataset Open

Additional TAU datasets for Wi-Fi fingerprinting-based positioning

Authors/Creators

  • 1. TAU

Description

1. Contents

This document describes two datasets collected at Tampere University facilities with samples taken from a Wi-Fi network interface for experiments with indoor positioning based on Wi-Fi fingerprinting.

To reference this dataset, please use

E.S. Lohan et al. “Additional TAU datasets for Wi-Fi fingerprinting-based positioning” 10.5281/zenodo.3819917

Additional reference using these datasets

Torres-Sospedra, J.; Quezada-Gaibor, D.; Mendoza-Silva, G. M.; Nurmi, J.; Koucheryavy, Y. and Huerta, J. New Cluster Selection and Fine-grained Search for k-Means Clustering and Wi-Fi Fingerprinting Proceedings of the Tenth International Conference on Localization and GNSS (ICL-GNSS), 2020.

Dataset format

Two independent datasets are provided, they are in different folders, namely “Database_Building01” and “Database_Building02” respectively. Each dataset includes two sets of samples:

  • radio map – a set of Wi-Fi samples collected at a grid of points (reference points);
  • evaluation – a set of Wi-Fi samples randomly collected in the evaluation area.

Two files are provided for each set that include the rss vectors and the coordinates. For the radio map, the provided files have their names starting with “rm_”; for the evaluation, the evaluation files have their names starting with “eval_”. For instance, for the radio map they are:

  • rm_crd.csv: holds coordinates (x,y)and floor identifier (z) where the samples were collected;
  • rm_rss.csv: holds the measured RSSI values from each of the Access Points (AP) detected in each sample;

All the file are described in the same format, and all files are CSV – Comma Separated Values plain text (UTF-8).

Coordinates: Each sample is associated to a pair of coordinates in a 2D Euclidean reference system. The origin of the reference system was chosen arbitrarily for convenience. The units are meters. Therefore, distances between points can be easy calculated. Moreover, the floor identifier is included to enable 3D positioning.

RSSI values: The RSSI values provided as read from the Wi-Fi network interface through the Android API. In each sample, a value of +100 was assigned to each AP not detected during a measurement. No information is provided about the MAC addresses of the APs. However, in the files, the same order is used for all samples, meaning that the values in each column are all associated to the same AP.

Both datasets are independent and none of the provided files include an identifier for each sample. The values in the two provided files are associated by the line number, meaning that the coordinates and RSSI values in the same line, in each file, refer to the same sample.

Files

Databases.zip

Files (1.4 MB)

Name Size Download all
md5:5384606cbe55a3423a56efbcb10a415e
1.4 MB Preview Download

Additional details

References

  • Torres-Sospedra, J.; Quezada-Gaibor, D.; Mendoza-Silva, G. M.; Nurmi, J.; Koucheryavy, Y. and Huerta, J. New Cluster Selection and Fine-grained Search for k-Means Clustering and Wi-Fi Fingerprinting Proceedings of the Tenth International Conference on Localization and GNSS (ICL-GNSS), 2020.