Training Setting #45

HalvesChen · 2024-12-24T10:04:24Z

Hi,
I notice that your method requires much more epoch(330) than other methods. Have you compared the reconstruction performance of the common setting (epoch=50)?

ShiFengyuan1999 · 2024-12-25T04:06:47Z

Hi @HalvesChen, we follow MAGVIT-v2 and adopt a long training. VQ methods with small codebook size (like 1024 and 4096) and low code dimension (e.g., 8 or 16) may saturate quickly in reconstruction performance, thus requiring fewer training epochs. IBQ gains further improvements due to the longer training, benefitting from large model compacity, i.e., large-size codebook, high-dimension codes, large model size, and high codebook utilization.

RobertLuo1 closed this as completed Dec 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Setting #45

Training Setting #45

HalvesChen commented Dec 24, 2024

ShiFengyuan1999 commented Dec 25, 2024

Training Setting #45

Training Setting #45

Comments

HalvesChen commented Dec 24, 2024

ShiFengyuan1999 commented Dec 25, 2024