Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is pyannote/diariazation pipeline very sensitive to language? #1821

Open
ywangwxd opened this issue Dec 30, 2024 · 0 comments
Open

Is pyannote/diariazation pipeline very sensitive to language? #1821

ywangwxd opened this issue Dec 30, 2024 · 0 comments

Comments

@ywangwxd
Copy link

ywangwxd commented Dec 30, 2024

Tested versions

pyannote/speaker-diarization-3.1

System information

Ubunt pyannote/speaker-diarization-3.1

Issue description

How should I improve its performance on Chinese?

Minimal reproduction example (MRE)

I have tested the pipeline on a Chinese audio file. I found the diariazation results is bad, even in easy cases with a long duration of female speech and male speech. The test on an English audio file is quite good though. To reproduce it, just take an audio file which contains speechs in Chinese from multiple speakers. To make the diariazation easy, we can choose one with distinctive speakers, e.g., a male and female speakers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants