Lumeo is a real-time speech processing application that utilizes Speechmatics Flow for audio input and output. It provides a user-friendly interface for voice interactions, allowing users to perform various tasks through voice commands.
- Real-time speech recognition and transcription
- Audio playback of responses
- Integration with Speechmatics Flow for voice processing
- Support for custom tools (e.g., stock price queries, internet searches)
- Chainlit UI for interactive voice conversations
- Python 3.7 or higher
- Required Python packages (install via
pip
):chainlit
pyaudio
speechmatics-flow
python-dotenv
-
Clone the repository:
git clone https://github.com/yourusername/lumeo.git cd lumeo
-
Create a virtual environment and activate it:
python -m venv venv source venv/bin/activate # On Windows use `venv\Scripts\activate`
-
Install the required packages:
pip install -r requirements.txt
-
Set up your environment variables:
- Create a
.env
file in the root directory and add your Speechmatics authentication token:SPEECHMATICS_AUTH_TOKEN=your_auth_token_here
- Create a
-
Start the application:
chainlit run lumeo.py
-
Open your web browser and navigate to
http://localhost:8000
. -
Press the
P
key to start talking. The application will listen for your voice input and process it in real-time. -
The transcriptions will appear in the chat UI, and audio responses will be played back through your selected audio output device.
Lumeo automatically detects and uses your connected headphones or earphones for audio playback. Ensure your audio devices are properly connected before starting the application.
Contributions are welcome! If you have suggestions or improvements, feel free to open an issue or submit a pull request.