Published August 7, 2024 | Version v1
Dataset Open

Anonymised Phone Call Dataset for Anomaly Detection

  • 1. ROR icon Universidade do Porto
  • 2. ROR icon INESC TEC

Description

The dataset provides anonymized information related to phone calls, including the following details:

1. Origin Numbers (A-Numbers)
2. Destination Numbers (B-Numbers)
3. Timestamp of the call
4. Call Result, indicating whether the call was blacklisted (coded as 001) or not (coded as 000)

The dataset is divided into two subsets with the following characteristics:

Dataset 1
- Collection Period: 24th July 2018 to 21st October 2018
- Duration: 89 days
- Total Records: 83,366,367 examples
- Unique A-Numbers: 9,006,011
- Unique B-Numbers: 2,387,932

Dataset 2
- Collection Period: 1st June 2019 to 30th June 2019
- Duration: 29 days
- Total Records: 32,879,670 examples
- Unique A-Numbers: 3,217,069
- Unique B-Numbers: 1,380,235

Files

dataset_1.zip

Files (1.2 GB)

Name Size Download all
md5:d640d981b574273d9836f09d9bb02b1b
902.7 MB Preview Download
md5:0cd3af996b5205c1b9db6dd99b77f22b
344.7 MB Preview Download

Additional details

Related works

Is part of
Journal article: 10.1007/S12243-020-00808-W (DOI)
Journal article: 10.1145/3429204.3429208 (DOI)
Conference paper: 10.1145/3341105.3373842 (DOI)
Other: https://ceur-ws.org/Vol-2579/BIgMine-2019_paper_2.pdf (URL)