There is a newer version of the record available.

Published March 29, 2020 | Version 0.1.0
Dataset Open

Oral cancer speech corpus for paper "Detecting and analysing spontaneous oral cancer speech in the wild"

  • 1. Netherlands Cancer Institute
  • 2. TU Delft

Description

DO NOT USE THIS --> THERE IS A NEW VERSION AVAILABLE HERE: https://zenodo.org/record/6308119

 

This is the oral cancer speech corpus used in the paper "Detecting and analysing spontaneous oral cancer speech in the wild".

Description

This dataset contains approximately 3 hours of oral cancer speech data collected from YouTube, including a file with additional metadata. We use this dataset to perform an oral cancer speech detection task in our paper.

Funding

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under Marie Sklodowska-Curie grant agreement No 766287. The Department of Head and Neck Oncology and surgery of the Netherlands Cancer Institute receives a research grant from Atos Medical (Horby, Sweden),
which contributes to the existing infrastructure for quality of life research.

Citation:

If you use this dataset please cite:

@misc{halpern2020detecting,
    title={Detecting and analysing spontaneous oral cancer speech in the wild},
    author={Bence Mark Halpern and Rob van Son and Michiel van den Brekel and Odette Scharenborg},
    year={2020},
    eprint={2007.14205},
    archivePrefix={arXiv},
    primaryClass={eess.AS}
}

 

Files

Files (5.8 GB)

Name Size Download all
md5:d1440840a95d2ac68ed0611fe9cff572
104.9 MB Download
md5:42782192b68281aab2c52e5e535184ef
104.9 MB Download
md5:0957576c2238fd081275e60727d362f8
104.9 MB Download
md5:57e966ad07924d75ddee7dc76b5131c4
104.9 MB Download
md5:9fdd03d75a7773f44811d94947f27c23
104.9 MB Download
md5:7455a8ddc9ae9b17d49f389490ffa08b
104.9 MB Download
md5:914be66bac48c66501173f15de4a8eb0
104.9 MB Download
md5:304af4848c399e26ae075fc893c70ff6
104.9 MB Download
md5:c29165b4dd56bd7d9a912ee73d2f5a2d
104.9 MB Download
md5:390ddf58facb6ef04c6043f6a0b7e466
104.9 MB Download
md5:9c506be2d83810ae5dcaa7c57bf60841
104.9 MB Download
md5:613409eefa52b71d01ef5cc18dad71c2
104.9 MB Download
md5:990045f6517c497d7a01d2aefef5fd62
104.9 MB Download
md5:ac2017de31acc81ad964629badecf4e1
104.9 MB Download
md5:c31dee7eda73722d8f84d0971dcb7313
104.9 MB Download
md5:14f7cec20a7999d93a56c8e9afec57c1
104.9 MB Download
md5:84404a408c91bba2a45fdaab9502a4cf
104.9 MB Download
md5:a28c04f1705fab68457bb2ee2cba2637
104.9 MB Download
md5:d88220c24e5efacfce2b47ae0ecbde02
104.9 MB Download
md5:1749089cc0a0c4b761e83eaef285dd9a
104.9 MB Download
md5:2b16590c61c157ab5c81867f4e848e8e
104.9 MB Download
md5:63e6b1850292fcedc508d18d46ab46bc
104.9 MB Download
md5:e07ed9979b71c6519f215de22a8c3bc7
104.9 MB Download
md5:331805609724a9cca1b2021487c07675
104.9 MB Download
md5:980d29b0161cd57c64d6fcc98198df9a
104.9 MB Download
md5:4c743f2a973c8697abfe539b76a4eb12
104.9 MB Download
md5:bb613f92f2f9d851f2d63c5daf46d66b
104.9 MB Download
md5:72811cec467d86f7b70ebda7a084347b
104.9 MB Download
md5:903d85ae318a461d7b25fcc340e4b7d2
104.9 MB Download
md5:4893f2bf60a77261544cebf0c36e8cbd
104.9 MB Download
md5:e17e6abdbbcedda6c0aa7569bbd64e8c
104.9 MB Download
md5:55cf64ab8c926337da23bb8da858d1dc
104.9 MB Download
md5:4318a110847cc2d4260add2156279c20
104.9 MB Download
md5:f58fda30d5e813c1675bb6ef4b25de88
104.9 MB Download
md5:54e446b1eeb0944232b98dee61eba63d
104.9 MB Download
md5:0ed1909badd09554958931bd9c72c781
104.9 MB Download
md5:96df5b293f4484883d06a0b22b86d740
104.9 MB Download
md5:7a730685d873dc8c4dd7527f24c25f10
104.9 MB Download
md5:03ed12ef854c273131f6b4973634b396
104.9 MB Download
md5:5282077ef17095967241aae23f23b43c
104.9 MB Download
md5:fbaabb48d8a7d61b88683e1cbbb91226
104.9 MB Download
md5:b22d07d17fdde95162f2dcba0dd6b9fa
104.9 MB Download
md5:1f480d31e59c461fabea6a130c152ce0
104.9 MB Download
md5:6b0cd14eb7cf6a494501580bfdaa8618
104.9 MB Download
md5:d8fc9edb8ca4be779dff9bbb9e3d4e3f
104.9 MB Download
md5:102e0383db7595aec329c98ae6a4218d
104.9 MB Download
md5:a6f29a8a415dfcfce5a266de68e63487
104.9 MB Download
md5:7da4732d37ee15276a2f3fafa8d5c50c
104.9 MB Download
md5:f24087f7f1c65986042772a1dd6614e2
104.9 MB Download
md5:f11f24b5fd64d6196cca9851215efd37
104.9 MB Download
md5:0b57c2d436613c1b4a21de319d1717ab
104.9 MB Download
md5:575e4b87448c62111c47304a85e0da41
104.9 MB Download
md5:a805479dba7b5f65971ddb99c3874ece
104.9 MB Download
md5:432ba4554207fcc68392fa840f8ac0f7
104.9 MB Download
md5:907365ee45f16a679ca2d49e9de6ede7
104.9 MB Download

Additional details

Funding

TAPAS – Training Network on Automatic Processing of PAthological Speech 766287
European Commission