Upgrading Kokoro: natural TTS for short bursts
•
16
@to-be
There are more details at https://hf.co/hexgrad/Kokoro-82M/discussions/21 and my Discord DMs are open if you have more questions, but essentially I am looking for segmented text-audio pairs: likely .txt
and .wav
pairs, with each .txt
being ~500 characters or less (needs to fit inside 512 token context hard limit) and the .wav
matching the text.