Cannot converge on image compression task #31

dwgan · 2025-04-13T06:47:31Z

Hi,

This work looks great.

I was trying to implement it on image compression task. I use this model as follows, and the input shape is 16, 32, 96, 96.

    def forward(self, x):
        b, c, h, w = x.shape
        x = x.reshape(b, c, -1).permute(0, 2, 1)
        x = self.act1(x)
        x = self.drop1(x)
        x = self.fc1(x)
        x = self.act2(x)
        x = self.drop2(x)
        x = self.fc2(x)
        x = x.permute(0, 2, 1).reshape(b, c, h, w)
        return x

However, my model cannot converge during training.

Training:  75%|██████████████                    | 1171/1563 [02:38<00:53,  7.36it/s, loss=nan]

Could you please give me some advice?

Thanks you very much.

The text was updated successfully, but these errors were encountered:

truong04 · 2025-05-13T15:19:02Z

I face the same problem in NLP, where my model loss does not decrease

dwgan · 2025-05-13T15:25:51Z

@truong04 I finally found that the problem was caused by using the wrong function. We should use KAT_Group2D to process the image signal, otherwise it cannot converge.

dwgan closed this as completed May 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cannot converge on image compression task #31

Cannot converge on image compression task #31

dwgan commented Apr 13, 2025

truong04 commented May 13, 2025

Uh oh!

dwgan commented May 13, 2025

Uh oh!

Cannot converge on image compression task #31

Cannot converge on image compression task #31

Comments

dwgan commented Apr 13, 2025

truong04 commented May 13, 2025

Uh oh!

dwgan commented May 13, 2025

Uh oh!