Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 220 Bytes

201125 Streaming end-to-end multi-talker speech recognition.md

File metadata and controls

7 lines (4 loc) · 220 Bytes

https://arxiv.org/abs/2011.13148

Streaming end-to-end multi-talker speech recognition (Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong)

역시나 rnn-t 기반. efficient rnn-t 구현이 필요하다...!

#transducer #asr