EchoScribe is a powerful real-time speech-to-text transcription application that leverages Deepgram's WebSocket API for highly accurate and efficient voice recognition.
- Real-time Speech-to-Text Conversion 📢➡️📝
- AI-powered Speech Recognition by Deepgram 🤖
- Low-latency Streaming via WebSocket 🚀
- Supports Multiple Languages & Models 🌍
Ensure you have the following installed before proceeding:
- Node.js (Recommended:
v18.x.xorv20.x.x) - Deepgram API Key 🔑
- Stable Internet Connection 🌐
Clone the repository and navigate into the project folder:
git clone https://github.com/yourusername/EchoScribe.git
cd EchoScribeInstall dependencies:
npm installSet up your Deepgram API key:
# Mac/Linux
export DEEPGRAM_API_KEY="your_actual_api_key"
# Windows (CMD)
set DEEPGRAM_API_KEY=your_actual_api_keyStart the server:
node server.jsSimply speak into your microphone and see real-time transcriptions displayed in your terminal or web interface.
Modify WebSocket URL parameters to adjust language/model settings as per your requirements.
If you face WebSocket failures, try these steps:
- ✅ Check API Key: Ensure
DEEPGRAM_API_KEYis correctly set. - 🔗 Verify WebSocket URL: Use
wss://api.deepgram.com/v1/listen. - 🌐 Check Internet Connection: Run
ping api.deepgram.com. - 🔄 Downgrade Node.js if needed: Use an LTS version (
v20.x.xorv18.x.x).
Add logging in server.js for better issue tracking:
console.log("Connecting to Deepgram...");
ws.on("open", () => console.log("✅ WebSocket connected"));
ws.on("close", () => console.log("❌ WebSocket disconnected"));
ws.on("error", (err) => console.error("WebSocket Error:", err));This project is licensed under the MIT License.
- Pareekshith P
- Arun Kumar K
- Mohammed Naseem
- Pozhilan A