Skip to content

AI-Powered Transcriber – A Python-based transcription tool using OpenAI’s Whisper to generate accurate captions and subtitles. Handles noisy audio, fast speech, and complex accents. Future plans include AI-powered accuracy review and contextual inference.

Notifications You must be signed in to change notification settings

btcmop/ai-transcriber

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

🎤 AI Transcriber - Whisper-Powered Transcription Tool

📌 Overview

AI Transcriber is a Python-based tool that converts speech to text using OpenAI’s Whisper model. Designed for content creators, podcasters, and musicians, this tool ensures high-accuracy captions, even for fast speech, low-quality audio, or background noise.

🚀 Features

AI-Powered Speech-to-Text - Uses Whisper for precise transcription.
Handles Noisy Audio & Fast Speech - Great for rap, interviews, or podcasts.
Editable Captions - Outputs a text file for easy review and editing.
Simple Command-Line Interface (CLI) - Just run the script and get your transcript.
Future Plans - AI-assisted accuracy review, web UI, YouTube plugin.

🛠️ Installation

  1. Clone the repository:
    git clone https://github.com/your-username/ai-transcriber.git
    cd ai-transcriber
  2. Install dependencies:
    pip install openai-whisper torch soundfile
  3. Run the Script:
    python transcribe.py your_audio_file.mp3
    

🔧 Usage

  1. Place your audio file (mp3, wav, or m4a) in the project folder.:
  2. Run:
    python transcribe.py example.mp3
  3. The transcription will appear in the terminal and be saved as example.mp3.txt:

📌 Future Enhancements

🔹 AI-powered accuracy review to catch mistakes.

🔹 Web App & Browser Extension for YouTube captions.

🔹 Custom Model Fine-Tuning for better slang/rap/music transcription.

💡 Contributing

Want to improve this? Fork the repo, submit PRs, or suggest ideas in the Issues tab!

📜 License

MIT License - Free to use and modify.

About

AI-Powered Transcriber – A Python-based transcription tool using OpenAI’s Whisper to generate accurate captions and subtitles. Handles noisy audio, fast speech, and complex accents. Future plans include AI-powered accuracy review and contextual inference.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages