ml-papers/papers/2023/230511 CoMoSpeech.md at main · rosinality/ml-papers · GitHub

https://arxiv.org/abs/2305.06908

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model (Zhen Ye, Wei Xue, Xu Tan, Jie Chen, Qifeng Liu, Yike Guo)

consistency model을 speech/singing voice synthesis에 적용해본 사례가 나왔네요.

#speech #audio_synthesis