ORTOptimizer for wav2vec2-bert #2232

aconeil · 2025-04-16T18:24:37Z

What does this PR do?

Add wav2vec2-bert to the list of possible models for optimization

Fixes #2221

Who can review?

@IlyasMoutawwakil

Add wav2vec2-bert to the list of possible models for optimization

ORTOptimizer for wav2vec2-bert

IlyasMoutawwakil · 2025-04-18T13:51:03Z

please add it to testing as well

eingrid · 2025-04-20T14:55:31Z

Should'nt we also add
"wav2vec2-bert":NormalizedTextConfig in here:

optimum/optimum/utils/normalized_config.py

Lines 232 to 236 in abd7da3

    
           # Contribution note: Please add new models in alphabetical order 
        
           _conf = { 
        
               "albert": NormalizedTextConfig, 
        
               "bart": BartLikeNormalizedTextConfig, 
        
               "bert": NormalizedTextConfig,

IlyasMoutawwakil · 2025-04-21T08:38:49Z

@eingrid yes that as well

aconeil · 2025-04-21T19:46:35Z

Wav2Vec2-Bert actually uses mel-spectrograms as the input, which is closer to the SpeechT5 Input than the wav2vec inputs.
I tried the suggested changes, with both NormalizedTextConfig and T5LikeNormalizedTextConfig and wasn't successful. I then tried adapting optimum/exporters/onnx/model_configs.py to add a specific class for the model, but receive an error when using the DummyAudioInputGenerator (since the input is different) and the DummySpeechT5InputGenerator (since there aren't speaker embeddings).
Do you have any suggestions?

aconeil · 2025-04-23T19:36:57Z

I tried with Speech2TextDummyAudioInputGenerator as well today without success. The input for Wav2Vec2Bert is input_features with a size of [1, x, 160].

aconeil added 3 commits April 14, 2025 14:40

ORTOptimizer for wav2vec2-bert

5166532

Add wav2vec2-bert to the list of possible models for optimization

ORTOptimizer for wav2vec2-bert

d59e023

Merge pull request #1 from aconeil/aconeil-patch-1

16a52bc

ORTOptimizer for wav2vec2-bert

aconeil marked this pull request as draft April 16, 2025 18:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ORTOptimizer for wav2vec2-bert #2232

ORTOptimizer for wav2vec2-bert #2232

aconeil commented Apr 16, 2025

IlyasMoutawwakil commented Apr 18, 2025

eingrid commented Apr 20, 2025 •

edited

Loading

IlyasMoutawwakil commented Apr 21, 2025

aconeil commented Apr 21, 2025

aconeil commented Apr 23, 2025

ORTOptimizer for wav2vec2-bert #2232

Are you sure you want to change the base?

ORTOptimizer for wav2vec2-bert #2232

Conversation

aconeil commented Apr 16, 2025

What does this PR do?

Who can review?

IlyasMoutawwakil commented Apr 18, 2025

eingrid commented Apr 20, 2025 • edited Loading

IlyasMoutawwakil commented Apr 21, 2025

aconeil commented Apr 21, 2025

aconeil commented Apr 23, 2025

eingrid commented Apr 20, 2025 •

edited

Loading