There is a newer version of the record available.

Published January 21, 2025 | Version v1
Conference paper Open

Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems

  • 1. ROR icon Beijing University of Posts and Telecommunications

Description

Here are the materials related to AudioShield, including:

  • source_code.zip: The source code of AudioShield.
  • datasets.zip: The training and testing datasets, processed in pickle format and ready for use. The 500 samples from Librispeech are used for training, and 2000 samples from VCTK are used for testing.
  • pretrained_models.zip: The pre-trained models required for training and testing, including the locally trained target ASR model, DeepSpeech, and the Autoencoder model VITS used by AudioShield.

Please unzip the files in datasets.zip into the source_code/datasets folder, and unzip the files in pretrained_models.zip into the source_code/pretrained folder.

For detailed installation and usage instructions, please refer to the README file in the source_code folder.

Files

datasets.zip

Files (2.7 GB)

Name Size Download all
md5:acc113a57d1562a901ff50fbfa9c3546
1.5 GB Preview Download
md5:1734d5aa0d1503d43f6eb193fdb2ff42
1.1 GB Preview Download
md5:06f0a059e0582cae08a123ae852a4d05
2.0 MB Preview Download