https://arxiv.org/abs/2305.06908
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model (Zhen Ye, Wei Xue, Xu Tan, Jie Chen, Qifeng Liu, Yike Guo)
consistency model을 speech/singing voice synthesis에 적용해본 사례가 나왔네요.
#speech #audio_synthesis