A survey on audio analysis: Text characterization and summarization

Maheshwar Reddy V; Deepika K; Adithya Surya Prakash K; Sanathan M

doi:10.5281/zenodo.14137275

Published March 30, 2024 | Version v1

Journal article Open

A survey on audio analysis: Text characterization and summarization

1. Associate Professor, Department of Computer Science (Artificial Intelligence and Machine Learning), ACE Engineering College, Hyderabad, Telangana, India.
2. IV B. Tech students Department of Computer Science (Artificial Intelligence and Machine Learning), ACE Engineering College, Hyderabad, Telangana, India.

The integration of cutting-edge natural language processing (NLP) technology for smooth audio-to-text conversion and summarization is examined in this survey. Utilizing Facebook’s BART model for succinct summaries and Google’s Speech-to-Text API for precise transcription. The report highlights the value of sophisticated summarization models and precise transcription. It talks about how the system can be used in a variety of fields, such as podcast and video transcript generation, automated meeting transcription and summarization, content indexing and search, and more. In addition to addressing issues like context preservation and bias reduction, the survey assesses relevant research on text generation, LSTM networks, and summarization techniques. Overall, by incorporating state-of-the-art technology, this study advances the processing of audio content and eventually makes it easier to extract valuable information.

Files

WJARR-2024-0789.pdf

Files (809.9 kB)

Name	Size	Download all
WJARR-2024-0789.pdf md5:6b918ce5ad023fd90a51f36b2d0ca324	809.9 kB	Preview Download

Additional details

DOI: 10.30574/wjarr.2024.21.3.0789

	All versions	This version
Views	18	18
Downloads	19	19
Data volume	17.0 MB	17.0 MB

A survey on audio analysis: Text characterization and summarization

Creators

Description

Files

WJARR-2024-0789.pdf

Files (809.9 kB)

Additional details

Identifiers