Range value of arousal, valence, dominance #22

trfnhle · 2023-01-03T11:37:54Z

I wonder what the range value of arousal, valence, and dominance is. As far as I know, model output is a logit vector size of 3 representing that feature and looks like its values range [0, 1]. I see that you use MSP-Conversation Corpus for fine-tuning. But when I looked at The MSP-Conversation Corpus paper paperlink, they mentioned that
"Notice that the values of the traces are in the range between -100 and 100. The figure shows that extreme values are uncommon. Most of the annotations are concentrated between -40 to 40 for valence, -20 to 50 for arousal, and -20 to 40 for dominance"

Do you guys normalize that feature, or do something related?

hagenw · 2023-01-03T12:08:33Z

Yes, databases tend to use different scales for arousal/valence/dominance like 0..5.
We normalize all scales to 0..1 for training. During inference most of the values returned by the model are in this range, but it can happen that you also get some values outside of that range.

bahtuchi · 2025-01-11T14:30:31Z

I just created a 10-second, completely silent .wav file to test the model. If i understood correctly, the values should all be 0.5, i.e. completely neutral, right?
I used the following setting:
interface = audinterface.Feature( model.labels(‘logits’), process_func=model, process_func_args={ ‘outputs’: ‘logits’, }, sampling_rate=sampling_rate, process_func_applies_sliding_window=False, win_dur=1, hop_dur=0.5, resample=True, verbose=True, )
I got 10 values, i.e. arousal, dominance and valence and they were always (0.55; 0.61; 0.64). How can this be explained?
I mean it is good that there isnt any variation but I am a bit confused why they are not all 0.5.

hagenw · 2025-01-13T14:21:49Z

The model was not trained on non-speech input (like silence) and might hence not be able to abstract to other inputs like silence or sound of objects. It is assumed that you always use a voice activity detection (VAD) and pass on only speech to the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Range value of arousal, valence, dominance #22

Range value of arousal, valence, dominance #22

trfnhle commented Jan 3, 2023 •

edited

Loading

hagenw commented Jan 3, 2023

bahtuchi commented Jan 11, 2025 •

edited

Loading

hagenw commented Jan 13, 2025

Range value of arousal, valence, dominance #22

Range value of arousal, valence, dominance #22

Comments

trfnhle commented Jan 3, 2023 • edited Loading

hagenw commented Jan 3, 2023

bahtuchi commented Jan 11, 2025 • edited Loading

hagenw commented Jan 13, 2025

trfnhle commented Jan 3, 2023 •

edited

Loading

bahtuchi commented Jan 11, 2025 •

edited

Loading