How to create high-quality offline video transcriptions and subtitles using Whisper and Python - 6 november 2024

Janssen, Olaf

doi:10.5281/zenodo.14047913

Published November 6, 2024 | Version v1

Software documentation Open

How to create high-quality offline video transcriptions and subtitles using Whisper and Python - 6 november 2024

Janssen, Olaf¹

1. National Library of the Netherlands

The article outlines a method for creating offline, high-quality video transcriptions and subtitles using OpenAI's Whisper model with Python, emphasizing privacy, accuracy, and accessibility without needing cloud-based speech-to-text services.

https://github.com/KBNLresearch/videotools

The author explores the Whisper model for automatic speech recognition (ASR) to address limitations in existing cloud-based services, such as low transcription quality, privacy concerns, file size restrictions, and costs.

Key advantages of using Whisper include:

Offline Capabilities and Privacy: Whisper's large model (around 3GB) can run locally on a laptop, enabling privacy-compliant transcription without internet dependency.
Language and Accuracy: The model performs exceptionally well with multiple languages, especially Dutch and English, and effectively transcribes complex terms and named entities.
Real-time Processing: The large model provides near real-time transcription speed (a 15-minute video processes in about 15-20 minutes). Smaller, faster models are also available with reduced accuracy.
Subtitle Generation: Whisper can automatically generate accurate subtitles, enhancing accessibility for viewers with hearing impairments

The article includes Python code examples and repository links to help users implement the Whisper-based transcription workflow. Tools like FFmpeg are needed to handle video and audio formats, and optional modules allow transcript refinement using ChatGPT, albeit at a cost to offline privacy.

This summary has been assisted by ChatGPT-4o on 6 November 2024

Files

How to create high-quality offline video transcriptions and subtitles using Whisper and Python.pdf

Files (2.7 MB)

Name	Size	Download all
How to create high-quality offline video transcriptions and subtitles using Whisper and Python.pdf md5:ac7dca9384d03e53fa2f612cc2fa22bc	2.7 MB	Preview Download

Additional details

URL: https://commons.wikimedia.org/wiki/File:How_to_create_high-quality_offline_video_transcriptions_and_subtitles_using_Whisper_and_Python_-_6_November_2024.pdf

Is derived from: Software documentation: https://ookgezellig.github.io/videotools/stories/How%20to%20create%20high-quality%20offline%20video%20transcriptions%20and%20subtitles%20using%20Whisper%20and%20Python.html (URL); Software documentation: https://kbnlresearch.github.io/videotools/stories/How%20to%20create%20high-quality%20offline%20video%20transcriptions%20and%20subtitles%20using%20Whisper%20and%20Python.html (URL)
Is identical to: Software documentation: https://commons.wikimedia.org/wiki/File:How_to_create_high-quality_offline_video_transcriptions_and_subtitles_using_Whisper_and_Python_-_6_November_2024.pdf (URL); Software documentation: https://commons.wikimedia.org/entity/M154939941 (URL)

Issued: 2024-11-06

	All versions	This version
Views	309	309
Downloads	102	102
Data volume	344.4 MB	344.4 MB

How to create high-quality offline video transcriptions and subtitles using Whisper and Python.pdf

Files (2.7 MB)

Identifiers

Related works

Dates

How to create high-quality offline video transcriptions and subtitles using Whisper and Python - 6 november 2024

Authors/Creators

Description

Files

How to create high-quality offline video transcriptions and subtitles using Whisper and Python.pdf

Files (2.7 MB)

Additional details

Identifiers

Related works

Dates