Compare to SimVQ #43

suruoxi · 2024-12-12T05:59:20Z

Have you compared IBQ to SimVQ? I think SimVQ has the similar motivation of "optimize all codebook embeddings".

ShiFengyuan1999 · 2024-12-13T08:04:51Z

Hi @suruoxi, thanks for your interest in our work.

SimVQ has a similar motivation to IBQ, but we adopt different methods to optimize all codebook embeddings. SimVQ optimizes a linear transformation space $W \in R^{D \times D}$, while our IBQ optimizes the codes ($C \in R^K\times D$) themselves, where $K\gg D$.

It can not compare SimVQ with IBQ directly, because SimVQ is trained on ImageNet 128x128 and IBQ is trained on ImageNet 256x256. Moreover, SimVQ only shows the reconstruction performance and does not give the generation results. But we can provide some results from the papers for reference.

Method	Train Resolution	Codebook Size	rFID $\downarrow$
SimVQ	128 $\times$ 128	262144	1.99
OpenMAGVIT2	128 $\times$ 128	262144	1.18
OpenMAGVIT2	256 $\times$ 256	262144	1.17
IBQ	256 $\times$ 256	262144	1.00

suruoxi · 2024-12-16T04:29:14Z

thanks for the reply, the rFID of IBQ is impressive.

suruoxi closed this as completed Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare to SimVQ #43

Compare to SimVQ #43

suruoxi commented Dec 12, 2024

ShiFengyuan1999 commented Dec 13, 2024

suruoxi commented Dec 16, 2024

Compare to SimVQ #43

Compare to SimVQ #43

Comments

suruoxi commented Dec 12, 2024

ShiFengyuan1999 commented Dec 13, 2024

suruoxi commented Dec 16, 2024