802.11 Managemement frames from a public location
Description
About
The following datasets were captured at a busy Belgian train station between 9pm and 10pm, it contains all 802.11 management frames that were captured. both datasets were captured with approximately 20 minutes between then.
Both datasets are represented by a pcap and CSV file. The CSV file contains the frame type, timestamps, signal strength, SSID and MAC addresses for every frame. In the pcap file, all generic 802.11 elements were removed for anonymization purposes.
Anonymization
All frames were anonymized by removing identifying information or renaming identifiers. Concretely, the following transformations were applied to both datasets:
- All MAC addresses were renamed (e.g. 00:00:00:00:00:01)
- All SSID's were renamed (e.g. NETWORK_1)
- All generec 802.11 elements were removed from the pcap
In the pcap file, anonymization actions could lead to "corrupted" frames because length tags do not correspond with the actual data. However, the file and its frames are still readable in packet analyzing tools such as Wireshark or Scapy.
The script which was used to anonymize is available in the dataset.
Data
N/o | Dataset 1 | dataset 2 |
---|---|---|
Frames | 36306 | 60984 |
Beacon frames | 19693 | 27983 |
Request frames | 798 | 1580 |
Response frames | 15815 | 31421 |
Identified Wi-Fi Networks | 54 | 70 |
Identified MAC addresses | 2092 | 2705 |
Identified Wireless devices | 128 | 186 |
Capturetime | 480s | 422s |
Dataset contents
The two datasets are stored in the directories `1/` and `2/`. Each directory contains:
- `capture-X.pcap`: an anonymized version of the original capture
- `capture-X.csv`: content of each captured frame (timestamp, MAC address...) saved as a CSV file
`anonymization.py` is the script which was used to remove identifiers.
`README.md` contains the documentation about the datasets
License
Copyright 2022-2023 Benjamin Vermunicht, Beat Signer, Maxim Van de Wynckel, Vrije Universiteit Brussel
Permission is hereby granted, free of charge, to any person obtaining a copy of this dataset and associated documentation files (the “Dataset”), to deal in the Dataset without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Dataset, and to permit persons to whom the Dataset is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions that make use of the Dataset.
THE DATASET IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE DATASET OR THE USE OR OTHER DEALINGS IN THE DATASET.
Files
datasets.zip
Files
(4.1 MB)
Name | Size | Download all |
---|---|---|
md5:835320ace908243f23cb03fc48ce44fc
|
4.1 MB | Preview Download |