Skip to content

[Draft] [Whisper] Add labels' in the whisper output #2252

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

wu6u3tw
Copy link
Contributor

@wu6u3tw wu6u3tw commented Jul 14, 2025

The original text output from whisper-large-v3 includes numbers and the normalization part are included in the accuracy_eval script.
Therefore, to get the digit's part of the output in the label dict. I add digits, some symbols in the labels.

@wu6u3tw wu6u3tw requested a review from a team as a code owner July 14, 2025 06:44
Copy link
Contributor

github-actions bot commented Jul 14, 2025

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@wu6u3tw wu6u3tw force-pushed the dev-tinyinl-add_labels_in_accuracy_eval_whisper branch from bf35648 to b0faf8a Compare July 14, 2025 06:46
@wu6u3tw
Copy link
Contributor Author

wu6u3tw commented Jul 14, 2025

recheck

@keithachorn-intel
Copy link
Contributor

I'm not sure this will align with the text normalization elsewhere in the reference. Numerical values were previously expanded to full words. Will have to check how the OpenAI normalizer handles numeric values.

@wu6u3tw wu6u3tw changed the title [Whisper] Add labels' in the whisper output [Draft] [Whisper] Add labels' in the whisper output Jul 15, 2025
@hanyunfan
Copy link
Contributor

I'm not sure this will align with the text normalization elsewhere in the reference. Numerical values were previously expanded to full words. Will have to check how the OpenAI normalizer handles numeric values.

@wu6u3tw Could you review Keith’s question and provide additional context or a possible response?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants