Skip to content

Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.

License

Notifications You must be signed in to change notification settings

my-north-ai/semantic_audio_filtering

Repository files navigation

Sematic Audio Filtering for ASR

Enhancing Automatic Speech Recognition: Effects of Semantic Audio Filtering on Models Performance Overview of the Methodology applied

The folders are distributed in the following order:

Filtering Framework-> All the steps for the creation of the filtering methods of synthetic audio

tts_data_augmentation -> Serves as the folder for all the files and the scripts to create synthetically generated audio

env.example -> example of how the .env file should be structured

The collection of Whisper Models, which have the best performance in our experiments, is available at :

Whisper-Large-v3: https://huggingface.co/my-north-ai/whisper-large-v3-pt

Whisper-Medium: https://huggingface.co/my-north-ai/whisper-medium-pt

Whisper-Small: https://huggingface.co/my-north-ai/whisper-small-pt

About

Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages