Sematic Audio Filtering for ASR

Enhancing Automatic Speech Recognition: Effects of Semantic Audio Filtering on Models Performance

Filtering Framework-> All the steps for the creation of the filtering methods of synthetic audio

tts_data_augmentation -> Serves as the folder for all the files and the scripts to create synthetically generated audio

env.example -> example of how the .env file should be structured

The collection of Whisper Models, which have the best performance in our experiments, is available at :

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
filtering_framework		filtering_framework
finetuning/args		finetuning/args
images		images
tts_data_augmentation		tts_data_augmentation
utils		utils
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
process_data_for_filtering.py		process_data_for_filtering.py
process_hf_dataset.py		process_hf_dataset.py
pyproject.toml		pyproject.toml
whisper_evaluation.py		whisper_evaluation.py
whisper_finetuning.py		whisper_finetuning.py