Automatic summarization of YouTube video transcription text using term frequency-inverse document frequency
- 1. Department of Computer Science, Faculity of Computer Science and Information Technology, University of Kerbala, Karbala, Iraq
Description
Automatic summarization is a technique for quickly introducing key information by abbreviating large sections of material. Summarization may apply to text and video with a different method to display the abstract of the subject. Natural language processing is employed in automated text summarization in this research, which applies to YouTube videos by transcribing and applying the summary stages in this study. Based on the number of words and sentences in the text, the method term frequencyinverse document frequency (TF-IDF) was used to extract the important keywords for the summary. Some videos are long and boring or take more time to display the information that sometimes finds in a few minutes. Therefore, the essence of the proposed system is to find the way to summarize the long video and introduce the important information to the user as a text with few numbers of lines to benefit the students or the researchers that have no time to spend with long videos for extract the useful data. The results have been evaluated using Rouge method on the convolutional neural network (CNN)-dailymail-master data set
Files
33 26097 v26i3 Jun22.pdf
Files
(736.8 kB)
Name | Size | Download all |
---|---|---|
md5:efca687d8533d722f80c753d35e05edb
|
736.8 kB | Preview Download |