Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems

Jin, Weifei

doi:10.5281/zenodo.14711220

Published January 21, 2025 | Version v1

Conference paper Open

Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems

Jin, Weifei¹

1. Beijing University of Posts and Telecommunications

Here are the materials related to AudioShield, including:

source_code.zip: The source code of AudioShield.
datasets.zip: The training and testing datasets, processed in pickle format and ready for use. The 500 samples from Librispeech are used for training, and 2000 samples from VCTK are used for testing.
pretrained_models.zip: The pre-trained models required for training and testing, including the locally trained target ASR model, DeepSpeech, and the Autoencoder model VITS used by AudioShield.

Please unzip the files in datasets.zip into the source_code/datasets folder, and unzip the files in pretrained_models.zip into the source_code/pretrained folder.

For detailed installation and usage instructions, please refer to the README file in the source_code folder.

Files

datasets.zip

Files (2.7 GB)

Name	Size	Download all
datasets.zip md5:acc113a57d1562a901ff50fbfa9c3546	1.5 GB	Preview Download
pretrained_models.zip md5:1734d5aa0d1503d43f6eb193fdb2ff42	1.1 GB	Preview Download
source_code.zip md5:06f0a059e0582cae08a123ae852a4d05	2.0 MB	Preview Download

	All versions	This version
Views	148	123
Downloads	179	97
Data volume	199.0 GB	125.7 GB

Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems

Creators

Description

Files

datasets.zip

Files (2.7 GB)