You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First of all, thank you for creating this application as a replacement for Google Translate.
The current language list only supports one type of Chinese, that is the simplified one.
It would be great if more variants of the Chinese language could be added, including Chinese (traditional) and Yue Chinese. By doing so, more Chinese users, especially those from Taiwan and Hong Kong, could benefit from your project.
Both Chinese (Traditional) and Yue Chinese / Chinese (Yue) are supported by NLLB and Whisper.
Thank you once again;)
The text was updated successfully, but these errors were encountered:
Thank you! From what I've seen whisper doesn't have different codes for traditional and simplified english, in fact it often gets confused between the two, NLLB instead should support both but I don't see yue chinese in the list of language codes, however I'll find out more and see what I can do 👍
Think it may be due to the fact that the Traditional/Simplified distinction only applies to the written language and not the spoken one, with Whisper only being able to designate spoken?
Think it may be due to the fact that the Traditional/Simplified distinction only applies to the written language and not the spoken one, with Whisper only being able to designate spoken?
Simply speaking, there are four settings in Chinese:
Mandarin + Simplified Chinese (Mainland China)
Mandarin + Traditional Chinese (Taiwan)
Cantonese (Yue) + Traditional Chinese (Hong Kong)
Cantonese (Yue) + Simplified Chinese (Malaysia)
Meanwhile, although both Taiwan and Hong Kong use traditional Chinese, they are not exactly the same, and many words/grammatical usages (mainly verbal ones) are not the same in both places. You may consider written Chinese a common symbol/ token of Chinese-speaking regions.
Back to our topic, Whisper can detect Cantonese/ Yue; however, the transcribed text would be written in Chinese.
If you check Whisper's Tokenizer, you can see both "zh" (Chinese/ Mandarin) and "yue" (Cantonese).
While I am unsure whether choosing "zh" will have simplified or traditional Chinese as output, choosing "yue" would have traditional Chinese as output.
First of all, thank you for creating this application as a replacement for Google Translate.
The current language list only supports one type of Chinese, that is the simplified one.
It would be great if more variants of the Chinese language could be added, including Chinese (traditional) and Yue Chinese. By doing so, more Chinese users, especially those from Taiwan and Hong Kong, could benefit from your project.
Both Chinese (Traditional) and Yue Chinese / Chinese (Yue) are supported by NLLB and Whisper.
Thank you once again;)
The text was updated successfully, but these errors were encountered: