C3Imaging

All

14 repositories

joint-speech-text
Public
Code repository for the paper: Joint Speech-Text Embeddings for Multitask Speech Processing
Python
•0•0•0•0•Updated Oct 10, 2024Oct 10, 2024
speech-augmentation
Public
Python
•0•4•0•0•Updated May 21, 2024May 21, 2024
child_tts_fastpitch
Public
Fastpitch text-to-speech (TTS) model for generating high-quality synthetic child speech. This study uses the transfer learning training pipeline. The approach involved finetuning a multi-speaker TTS model to work with child speech. We use the publicly available MyST dataset (55 hours) for our finetuning experiments.
0•4•0•0•Updated Jan 25, 2024Jan 25, 2024
childASR_w2v2
Public
This repository provides trained checkpoints for finetuning across different child speech datasets for improving the performance on ASR for child speech. .
0•2•0•0•Updated Oct 13, 2023Oct 13, 2023
whisper_non_native_child_asr
Public
Python
•0•1•0•0•Updated Sep 20, 2023Sep 20, 2023
child_asr_conformer
Public
A comparative analysis between Conformer Transducer, Whisper and Wav2vec2 for improving the child speech recognition
0•2•0•0•Updated Aug 4, 2023Aug 4, 2023
whisper_child_asr
Public
0•5•1•0•Updated May 23, 2023May 23, 2023
SyntheticHeadPose
Public
Python
•
MIT License
•0•2•0•0•Updated Jan 29, 2023Jan 29, 2023
ChildTTS
Public
This repository contains synthetic TTS generated audio files for presenting our research work
0•1•0•0•Updated Mar 24, 2022Mar 24, 2022
Deep-Learning-Techniques
Public
GNU General Public License v3.0
•3•12•0•0•Updated Jan 23, 2022Jan 23, 2022
C3Imaging.github.io
Public
Github Page for C3Imaging
HTML
•0•0•0•0•Updated Jan 20, 2022Jan 20, 2022
Image-Depth-Analysis
Public
GNU General Public License v3.0
•0•0•0•0•Updated Jun 30, 2017Jun 30, 2017
Security-and-Block-Chain
Public
GNU General Public License v3.0
•0•0•0•0•Updated Jun 30, 2017Jun 30, 2017
Biometrics
Public
GNU General Public License v3.0
•0•0•0•0•Updated Jun 30, 2017Jun 30, 2017