Train speech task with IBQ, but the codebook may collapse to a few values(quant_loss -> 0) #51

yufan-aslp · 2025-03-11T02:52:24Z

We introduced IBQ into speech-related tasks to model the intermediate hidden layer features of the speech encoder. However, during training, the quantization loss suddenly drops to zero at a certain stage, resulting in codebook collapse. Have you encountered a similar issue on your side?

ShiFengyuan1999 · 2025-03-13T07:57:24Z

Hi @yufan-aslp,
We did not observe this phenomenon during training on images. The sudden drop in quantization loss to zero may indicate that the model is overfitting to a very small subset of codes. To investigate this, you can print the selected indices during training and monitor the codebook usage. If this is indeed the case, we recommend modifying training hyperparameters such as the learning rate and loss weights.

RobertLuo1 closed this as completed Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train speech task with IBQ, but the codebook may collapse to a few values(quant_loss -> 0) #51

Train speech task with IBQ, but the codebook may collapse to a few values(quant_loss -> 0) #51

yufan-aslp commented Mar 11, 2025

ShiFengyuan1999 commented Mar 13, 2025

Train speech task with IBQ, but the codebook may collapse to a few values(quant_loss -> 0) #51

Train speech task with IBQ, but the codebook may collapse to a few values(quant_loss -> 0) #51

Comments

yufan-aslp commented Mar 11, 2025

ShiFengyuan1999 commented Mar 13, 2025