Skip to content

dev-aditya/whisper-youtube-transcriber

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎙️ YouTube to Whisper Transcription UI

A user-friendly web interface to download YouTube audio and transcribe it using OpenAI's Whisper.

✨ Features

  • YouTube & File Input: Transcribe from YouTube URLs or local audio files.
  • Organized Storage: Automatically saves audio and transcripts to a transcriptions/ folder, organized by video title.
  • Multiple Whisper Models: Choose from tiny, base, small, medium, or large for a balance of speed and accuracy.
  • Language Support: Auto-detect or manually select from dozens of languages.
  • Translation to English: Translate any supported language into English.
  • Multiple Export Formats: Get transcripts as TXT, SRT, VTT, and JSON.

🚀 Quick Start

Prerequisites

  • Conda or Miniconda
  • FFmpeg (required for audio processing)

Installation

1. Create Conda Environment

First, create the conda environment from the environment.yml file:

conda env create -f environment.yml
conda activate whisper

2. Install PyTorch

The PyTorch installation varies by platform. Choose the appropriate method for your system (CUDA/Intel/AMD)

Note: Visit PyTorch's official website for the latest installation commands and CUDA version compatibility.

3. Install FFmpeg

FFmpeg is required for audio processing. Install it based on your platform. Instructions on FFmpeg website ffmpeg.org

2. Run the Application

Once the environment is created, you can start the app.

Method 1: Batch File (Easiest) Simply double-click run_app.bat.

Method 2: Manual Start

conda activate whisper
python app.py

The application will open in your browser at http://127.0.0.1:7860.

📖 How to Use

  1. Paste a YouTube URL or upload an audio file.
  2. Select a Whisper Model (start with base for a good balance).
  3. Choose your Language (auto is usually fine).
  4. Select the Task (transcribe or translate).
  5. Choose any Export Formats you need.
  6. Click "🚀 Download & Transcribe" or "🚀 Transcribe".

Your files will be saved in the transcriptions folder.

🛠️ Technical Details

  • Environment: The conda environment is managed by environment.yml.
  • Downloader: Uses the yt-dlp Python package.
  • File Structure: All output is saved in D:\ML\whisper\transcriptions\[Video Title]\.

🔧 Troubleshooting

  • MAY THE AI LORD BE WITH YOU

This project uses OpenAI's Whisper and the yt-dlp project.

About

A simple web UI to download audio from YouTube and transcribe it using OpenAI's Whisper (all locally).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published