A survey on audio analysis: Text characterization and summarization
- 1. Associate Professor, Department of Computer Science (Artificial Intelligence and Machine Learning), ACE Engineering College, Hyderabad, Telangana, India.
- 2. IV B. Tech students Department of Computer Science (Artificial Intelligence and Machine Learning), ACE Engineering College, Hyderabad, Telangana, India.
Description
The integration of cutting-edge natural language processing (NLP) technology for smooth audio-to-text conversion and summarization is examined in this survey. Utilizing Facebook’s BART model for succinct summaries and Google’s Speech-to-Text API for precise transcription. The report highlights the value of sophisticated summarization models and precise transcription. It talks about how the system can be used in a variety of fields, such as podcast and video transcript generation, automated meeting transcription and summarization, content indexing and search, and more. In addition to addressing issues like context preservation and bias reduction, the survey assesses relevant research on text generation, LSTM networks, and summarization techniques. Overall, by incorporating state-of-the-art technology, this study advances the processing of audio content and eventually makes it easier to extract valuable information.
Files
WJARR-2024-0789.pdf
Files
(809.9 kB)
Name | Size | Download all |
---|---|---|
md5:6b918ce5ad023fd90a51f36b2d0ca324
|
809.9 kB | Preview Download |