Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published June 1, 2022 | Version v1
Journal article Open

Automatic summarization of YouTube video transcription text using term frequency-inverse document frequency

  • 1. Department of Computer Science, Faculity of Computer Science and Information Technology, University of Kerbala, Karbala, Iraq

Description

Automatic summarization is a technique for quickly introducing key information by abbreviating large sections of material. Summarization may apply to text and video with a different method to display the abstract of the subject. Natural language processing is employed in automated text summarization in this research, which applies to YouTube videos by transcribing and applying the summary stages in this study. Based on the number of words and sentences in the text, the method term frequencyinverse document frequency (TF-IDF) was used to extract the important keywords for the summary. Some videos are long and boring or take more time to display the information that sometimes finds in a few minutes. Therefore, the essence of the proposed system is to find the way to summarize the long video and introduce the important information to the user as a text with few numbers of lines to benefit the students or the researchers that have no time to spend with long videos for extract the useful data. The results have been evaluated using Rouge method on the convolutional neural network (CNN)-dailymail-master data set

Files

33 26097 v26i3 Jun22.pdf

Files (736.8 kB)

Name Size Download all
md5:efca687d8533d722f80c753d35e05edb
736.8 kB Preview Download