[Whisper] Conflict of logic in accuracy_eval.py

Conflict of logic in accuracy_eval.py
2 steps are performed in the script (in order) for the text output of whisper.
1. [Filter](https://github.com/mlcommons/inference/blob/1bc3e998cb29a2ccb7635a5c74c875bf0c3b6432/speech2text/accuracy_eval.py#L31): only certain characters are accepted from the text output
e.g. only ‘a-z’, ‘ ’, ‘.’ is accepted

2. [Normalize](https://github.com/mlcommons/inference/blob/1bc3e998cb29a2ccb7635a5c74c875bf0c3b6432/speech2text/accuracy_eval.py#L89): convert certain phrases into another form
e.g. “one hundred dollar” -> “100$”.

Issues: Normalize will violate the filtering logic, in the example above
“100$” should be filtered out, but added because normalization comes 2nd

Fix
- Option 1: accept digits and dollar sign: PR filed https://github.com/mlcommons/inference/pull/2252 
  
- Option 2: do normalization before filtering
Risk: might change ref accuracy. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Whisper] Conflict of logic in accuracy_eval.py #2258

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Whisper] Conflict of logic in accuracy_eval.py #2258

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions