Gradient shape unexpected #3

snykral · 2023-05-25T01:10:32Z

`File ~\Python\PyTorch\RL\utils\optim.py:61, in Sophia.hutchinson(self, p, grad)
59 def hutchinson(self, p, grad):
60 u = torch.randn_like(grad)
---> 61 hessian_vector_product = torch.autograd.grad(grad.dot(u), p, retain_graph=True)[0]
62 return u * hessian_vector_product

RuntimeError: 1D tensors expected, but got 4D and 4D tensors`
Does it run on any network architecture?

Upvote & Fund

We're using Polar.sh so you can upvote and help fund this issue.
We receive the funding once the issue is completed & confirmed by you.
Thank you in advance for helping prioritize & fund our backlog.

bbbxyz · 2023-05-25T01:56:10Z

Looks like a bug. torch.dot() only works on 1D vectors. You could try using torch.sum(grad * u) instead.
Unless you need this urgently, I'd suggest waiting for the official implementation to be released tomorrow.

kyegomez · 2023-05-25T03:27:56Z

Just fixed it try upgrading to new version please!

kyegomez · 2023-05-25T03:28:30Z

Looks like a bug. torch.dot() only works on 1D vectors. You could try using torch.sum(grad * u) instead. Unless you need this urgently, I'd suggest waiting for the official implementation to be released tomorrow.

"we are aiming to release tomorrow" -Lol aiming

snykral · 2023-05-25T12:21:07Z

Just fixed it try upgrading to new version please!

It worked, but now I'm facing the same issue as #7. Somehow, grad.requires_grad is False when it arrives at the optimizer.

Also, I had to comment some lines at init.py, because their files didn't come with the library:
#from experiments.training import trainer
#from Sophia.decoupled_sophia.decoupled_sophia import DecoupledSophia

bbbxyz · 2023-05-25T12:24:28Z

FYI https://github.com/Liuhong99/Sophia

kyegomez · 2023-05-25T16:31:17Z

I've upgraded it, now try upgrading with pip 😊

Kingsleyandher · 2023-05-30T03:46:11Z

did you solve this question? I also meet this question in megatron...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient shape unexpected #3

Gradient shape unexpected #3

snykral commented May 25, 2023 •

edited by polar-sh bot

Loading

bbbxyz commented May 25, 2023

kyegomez commented May 25, 2023

kyegomez commented May 25, 2023

snykral commented May 25, 2023

bbbxyz commented May 25, 2023

kyegomez commented May 25, 2023

Kingsleyandher commented May 30, 2023

Gradient shape unexpected #3

Gradient shape unexpected #3

Comments

snykral commented May 25, 2023 • edited by polar-sh bot Loading

Upvote & Fund

bbbxyz commented May 25, 2023

kyegomez commented May 25, 2023

kyegomez commented May 25, 2023

snykral commented May 25, 2023

bbbxyz commented May 25, 2023

kyegomez commented May 25, 2023

Kingsleyandher commented May 30, 2023

snykral commented May 25, 2023 •

edited by polar-sh bot

Loading