speech

Speech APIs for real time voice analysis.

This is an experiment in combining, speech to text with tex to speech and language generation.

You are super welcome to help out. <3

How to run

After cloning the repo you need the react server, python server and text-to-speech docker server to all be running. Follow the steps for each to get it working on your machine.

React web app

npm install
npm start

React web app fix

export NODE_OPTIONS=--openssl-legacy-provider

Python GPT2 server

cd python
bash run_local.sh

This spins up a local server running GPT2 hosted on FastApi.

Text to speech docker

https://github.com/synesthesiam/docker-mozillatts

docker run -it -p 5002:5002 synesthesiam/mozillatts

Speech to text web

https://www.npmjs.com/package/react-speech-recognition

import React from 'react'
import SpeechRecognition, { useSpeechRecognition } from 'react-speech-recognition'
import { playAudio } from './App.js'

const Dictaphone = () => {
  const { transcript, resetTranscript } = useSpeechRecognition()

  if (!SpeechRecognition.browserSupportsSpeechRecognition()) {
    return null
  }

  return (
    <div>
      <button onClick={SpeechRecognition.startListening}>Start</button>
      <button onClick={SpeechRecognition.stopListening}>Stop</button>
      <button onClick={resetTranscript}>Reset</button>
      <button onClick={() => playAudio(transcript)}>Transcribe</button>
      <p>{transcript}</p>
    </div>
  )
}
export default Dictaphone

Running inside react

export async function playAudio(text) {
  var audio = new Audio(`http://localhost:5002/api/tts?text=${encodeURIComponent(text)}`);  
  audio.type = 'audio/wav';

  try {
    await audio.play();
    console.log('Playing...');
  } catch (err) {
    console.log('Failed to play...' + err);
  }
}

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
public		public
python		python
src		src
web_speech_demo		web_speech_demo
web_speech_synthesis_demo		web_speech_synthesis_demo
.gitignore		.gitignore
README.md		README.md
alice.wav		alice.wav
app.png		app.png
express.js		express.js
male.wav		male.wav
package-lock.json		package-lock.json
package.json		package.json
run.sh		run.sh
test.txt		test.txt
tts.sh		tts.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech

Speech APIs for real time voice analysis.

How to run

React web app

React web app fix

Python GPT2 server

Text to speech docker

Speech to text web

About

Releases

Packages

Languages

BirgerMoell/speech

Folders and files

Latest commit

History

Repository files navigation

speech

Speech APIs for real time voice analysis.

How to run

React web app

React web app fix

Python GPT2 server

Text to speech docker

Speech to text web

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages