Le Scribe 📜

A fully local Google Meet transcription bot for macOS (Apple Silicon).

Joins Google Meet as a guest bot — no Google account required
Captures audio from both remote participants (via BlackHole) and your own mic
Transcribes in real-time with mlx-whisper — fully on-device, no cloud API
Speaker diarization — knows who said what (SPEAKER_00, SPEAKER_01, ...)
Live transcript visible in a web UI at http://localhost:5050
Saves full session transcript as .txt, .json, and .pdf

Requirements

Requirement	Notes
macOS (Apple Silicon)	M1/M2/M3/M4 — mlx-whisper is Apple Silicon only
Python 3.11+	`brew install [email protected]`
Homebrew	https://brew.sh
Google Chrome	Must be installed
BlackHole 2ch	Virtual audio loopback — `brew install blackhole-2ch`
HuggingFace account	Free — required for speaker diarization model

Installation

1. Install Python dependencies

cd meet-transcriber
bash setup.sh

2. Set up speaker diarization (HuggingFace)

Speaker diarization uses pyannote.audio, which requires accepting model terms:

Create a free account at https://huggingface.co
Accept terms at https://huggingface.co/pyannote/speaker-diarization-3.1
Accept terms at https://huggingface.co/pyannote/segmentation-3.0
Generate a token at https://huggingface.co/settings/tokens (read access)
Add your token to transcriber.py:
```
_HF_TOKEN = "your_token_here"
```

2. Install BlackHole

brew install blackhole-2ch

Reboot after installing so macOS registers the new audio device.

3. Fix Python SSL certificates (Python 3.14+ only)

open "/Applications/Python 3.14/Install Certificates.command"

4. Install ChromeDriver

The bot needs a ChromeDriver binary that matches your Chrome version.

Check your Chrome version at chrome://settings/help
Download the matching arm64 ChromeDriver from https://googlechromelabs.github.io/chrome-for-testing/

Place it at:

~/Library/Application Support/meet_transcriber/chromedriver

Make it executable and sign it:

chmod +x ~/Library/Application\ Support/meet_transcriber/chromedriver
xattr -cr ~/Library/Application\ Support/meet_transcriber/chromedriver
codesign --sign - --force ~/Library/Application\ Support/meet_transcriber/chromedriver

Open it once so macOS prompts you to allow it:
```
open ~/Library/Application\ Support/meet_transcriber/chromedriver
```
Then go to System Settings → Privacy & Security and click Allow Anyway.

5. Configure Audio MIDI Setup (one-time)

This routes meeting audio through BlackHole so the bot can capture it.

Open Spotlight → Audio MIDI Setup
Click + → Create Multi-Output Device
Check both:
- BlackHole 2ch ← set this as the master clock device
- Your speakers or headphones
Set the sample rate to 48000 Hz
Open System Settings → Sound → Output and select the Multi-Output Device

Important: The master clock device in the Multi-Output Device must be BlackHole 2ch (not your headphones), otherwise the sample rate will be forced to your headphone's rate and audio capture will fail.

Running

Web UI (recommended)

python server.py

Opens a browser at http://localhost:5050. Paste your Meet URL, choose language, and click Start.

Command line

python bot.py --meet https://meet.google.com/xxx-xxxx-xxx --lang auto

How it works

Chrome opens as a guest and navigates to the Meet URL
The bot enters the name "Transcription Bot" and asks to join
The meeting host admits the bot
Audio capture starts — two streams are mixed:
- BlackHole: captures audio from other participants (played by the bot's Chrome)
- Mic: captures your own voice (never echoed back by Meet)
VAD (Silero) detects when someone finishes speaking and triggers transcription

Pyannote diarizes the chunk (who spoke), Whisper transcribes it (what was said):

[00:00:08] (fr) [SPEAKER_00] Bonjour tout le monde, bienvenue à cette réunion.
[00:00:15] (en) [SPEAKER_01] Let's start with the agenda for today.

Stop the session — transcript is saved automatically

CLI options

--meet URL               Google Meet URL (required)
--lang CODE              Language code ('fr', 'en', etc.) or 'auto' (default: auto)
--bot-name NAME          Display name in the meeting (default: Transcription Bot)
--chunk-seconds N        Audio chunk size in seconds (default: 10)
--silence-threshold F    RMS cutoff below which chunks are skipped (default: 0.005)
--admission-timeout N    Seconds to wait to be admitted (default: 300)

Transcript output

Saved to ~/Documents/transcriptions/:

Text (YYYY-MM-DD_HH-MM_meet.txt):

Google Meet Transcript — 2026-04-07 17:53
============================================================

[00:00:10] (fr) Bonjour tout le monde, bienvenue à cette réunion.
[00:00:20] (en) Let's start with the agenda for today.

JSON (YYYY-MM-DD_HH-MM_meet.json):

[
  {"time": "00:00:10", "lang": "fr", "text": "Bonjour tout le monde..."},
  {"time": "00:00:20", "lang": "en", "text": "Let's start with the agenda..."}
]

PDF export is also available from the web UI.

Troubleshooting

All audio chunks skipped as silence (RMS=0.00000)

Make sure the Multi-Output Device is set as system output in Sound settings
Make sure BlackHole 2ch is the master clock in the Multi-Output Device (sets sample rate to 48kHz)
Play audio and verify you hear it through your speakers/headphones

Chrome opens and immediately closes

ChromeDriver version must match Chrome version exactly
Re-run the xattr + codesign steps from Installation
Open chromedriver once manually and allow it in Privacy & Security

Stuck at "Waiting to be admitted"

The host needs to admit the bot from the meeting participants panel
Increase timeout: --admission-timeout 600

Model download on first run

mlx-community/whisper-large-v3-turbo (~800 MB) downloads once to ~/.cache/huggingface/. Pre-download it before your first meeting:

python3 -c "import mlx_whisper, numpy as np; mlx_whisper.transcribe(np.zeros(16000, dtype='float32'), path_or_hf_repo='mlx-community/whisper-large-v3-turbo')"

Contributing

This project is actively evolving. We're looking to add new features and improve the overall experience — contributions, ideas, and feedback are very welcome.

Some directions we're exploring:

Speaker diarization (who said what)
Automatic meeting summary generation
Support for other meeting platforms (Zoom, Teams)
Zoom and calendar integrations
Improved noise filtering and audio quality

Feel free to open an issue, submit a pull request, or reach out directly if you want to collaborate.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
templates		templates
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
audio.py		audio.py
bot.py		bot.py
run.sh		run.sh
server.py		server.py
setup.sh		setup.sh
transcriber.py		transcriber.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Le Scribe 📜

Requirements

Installation

1. Install Python dependencies

2. Set up speaker diarization (HuggingFace)

2. Install BlackHole

3. Fix Python SSL certificates (Python 3.14+ only)

4. Install ChromeDriver

5. Configure Audio MIDI Setup (one-time)

Running

Web UI (recommended)

Command line

How it works

CLI options

Transcript output

Troubleshooting

All audio chunks skipped as silence (RMS=0.00000)

Chrome opens and immediately closes

Stuck at "Waiting to be admitted"

Model download on first run

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Le Scribe 📜

Requirements

Installation

1. Install Python dependencies

2. Set up speaker diarization (HuggingFace)

2. Install BlackHole

3. Fix Python SSL certificates (Python 3.14+ only)

4. Install ChromeDriver

5. Configure Audio MIDI Setup (one-time)

Running

Web UI (recommended)

Command line

How it works

CLI options

Transcript output

Troubleshooting

All audio chunks skipped as silence (RMS=0.00000)

Chrome opens and immediately closes

Stuck at "Waiting to be admitted"

Model download on first run

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages