v0.0.46

aconchillo released this 20 Oct 01:28

· 1215 commits to main since this release

ee5ae0d

Added

Added audio_passthrough parameter to STTService. If enabled it allows audio frames to be pushed downstream in case other processors need them.
Added input parameter options for PlayHTTTSService and PlayHTHttpTTSService.

Changed

Changed DeepgramSTTService model to nova-2-general.
Moved SileroVAD audio processor to processors.audio.vad.
Module utils.audio is now audio.utils. A new resample_audio function has been added.
PlayHTTTSService now uses PlayHT websockets instead of HTTP requests.
The previous PlayHTTTSService HTTP implementation is now PlayHTHttpTTSService.
PlayHTTTSService and PlayHTHttpTTSService now use a voice_engine of PlayHT3.0-mini, which allows for multi-lingual support.
Renamed OpenAILLMServiceRealtimeBeta to OpenAIRealtimeBetaLLMService to match other services.

Deprecated

LLMUserResponseAggregator and LLMAssistantResponseAggregator are mostly deprecated, use OpenAILLMContext instead.
The vad package is now deprecated and audio.vad should be used instead. The avd package will get removed in a future release.

Fixed

Fixed an issue that would cause an error if no VAD analyzer was passed to LiveKitTransport params.
Fixed SileroVAD processor to support interruptions properly.

Other

Added examples/foundational/07-interruptible-vad.py. This is the same as 07-interruptible.py but using the SileroVAD processor instead of passing the VADAnalyzer in the transport.

Assets 2