Skip to content
Change the repository type filter

All

    Repositories list

    • Code repository for the paper: Joint Speech-Text Embeddings for Multitask Speech Processing
      Python
      0000Updated Oct 10, 2024Oct 10, 2024
    • Python
      0400Updated May 21, 2024May 21, 2024
    • Fastpitch text-to-speech (TTS) model for generating high-quality synthetic child speech. This study uses the transfer learning training pipeline. The approach involved finetuning a multi-speaker TTS model to work with child speech. We use the publicly available MyST dataset (55 hours) for our finetuning experiments.
      0400Updated Jan 25, 2024Jan 25, 2024
    • This repository provides trained checkpoints for finetuning across different child speech datasets for improving the performance on ASR for child speech. .
      0200Updated Oct 13, 2023Oct 13, 2023
    • Python
      0100Updated Sep 20, 2023Sep 20, 2023
    • A comparative analysis between Conformer Transducer, Whisper and Wav2vec2 for improving the child speech recognition
      0200Updated Aug 4, 2023Aug 4, 2023
    • 0510Updated May 23, 2023May 23, 2023
    • Python
      MIT License
      0200Updated Jan 29, 2023Jan 29, 2023
    • ChildTTS

      Public
      This repository contains synthetic TTS generated audio files for presenting our research work
      0100Updated Mar 24, 2022Mar 24, 2022
    • GNU General Public License v3.0
      31200Updated Jan 23, 2022Jan 23, 2022
    • Github Page for C3Imaging
      HTML
      0000Updated Jan 20, 2022Jan 20, 2022
    • GNU General Public License v3.0
      0000Updated Jun 30, 2017Jun 30, 2017
    • GNU General Public License v3.0
      0000Updated Jun 30, 2017Jun 30, 2017
    • GNU General Public License v3.0
      0000Updated Jun 30, 2017Jun 30, 2017