Ultravox Websocket Example

This is an example of integrating with Ultravox using a websocket connection in Python. It uses the websockets library from PyPI.

See the documentation for more information, such as other kinds of data messages that can be sent and received.

Features

Real-time bidirectional audio streaming via websocket
Configurable system prompts, voices, etc.
Robust audio handling alongside full interruption support
Example "Dr. Donut" drive-thru attendant implementation
Built-in client-side tool example for use with Dr. Donut

Prerequisites

uv for python environment and package management
Valid Ultravox API key

Running

Start by setting your api key environment variable.

export ULTRAVOX_API_KEY=<your_key>

Now you can run the example.

uv run websocket_client.py

Note that on the first run, uv will set up your Python environment and download relevant dependencies.

There are several command-line options to customize the call if you like.

# See all available options
uv run websocket_client.py --help

# Use a custom voice and system prompt
uv run websocket_client.py -V Mark --system-prompt "your prompt here"

# Run with debug logging
uv run websocket_client.py -v

Development Tools

This project uses uv with ruff for formatting and linting.

uv run ruff format
uv run ruff check --fix

Additional Notes

If you'd like to use this as a starting point for your own code, here are some important points to understand:

User audio is constantly streamed in its own task. Ultravox relies on continuous audio for timing and interruptions, so it's important that your implementation do something similar. You should aim to send 20ms of audio every 20ms, though different frame sizes also work.
The example uses 48kHz audio for input and output. This is configurable, but be sure to pick and set an appropriate rate for your use case.
The example uses a local speaker and microphone. This is useful for demonstration, but any real server-side implementation will presumably want to pipe audio to/from somewhere else, such as your client over webRTC. (To connect to Ultravox Realtime directly from a client, use one of the webRTC client SDKs.)
The example handles PlaybackClearBuffer messages so that 30s of audio can be buffered on the client without impacting interruptions. If you choose to remove PlaybackClearBuffer handling, you should also reduce clientBufferSizeMs to ensure interruptions can still terminate generated audio promptly. The default is 60ms, which strikes a reasonable balance between perceived interrupt latency and possible audio underflow.
The example adds an apiVersion query parameter in the url. This parameter is optional as there is currently only one version of the websocket API (version 1).
This example ignores user transcripts and doesn't handle agent transcripts robustly. See the webRTC client SDK for an example of more complete handling.

For more information see the Ultravox Documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock
websocket_client.py		websocket_client.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ultravox Websocket Example

Features

Prerequisites

Running

Development Tools

Additional Notes

About

Releases

Packages

Languages

License

fixie-ai/ultravox-websocket-example-client

Folders and files

Latest commit

History

Repository files navigation

Ultravox Websocket Example

Features

Prerequisites

Running

Development Tools

Additional Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages