Using Megatron Train GPT3 #21

Kingsleyandher · 2023-05-30T07:50:37Z

Hello, there was an error when I used the Sophia optimizer to train GPT3 with Megatron. The error point is that grad cannot be substituted into the optimizer with require_grad = True state to calculate the second derivative. Do you know how to solve this problem?

File "/root/miniconda3/envs/torch18/lib/python3.7/site-packages/torch/autograd/__init__.py", line 277, in grad allow_unused, accumulate_grad=False) # Calls into the C++ engine to run the backward pass RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn.

Upvote & Fund

We're using Polar.sh so you can upvote and help fund this issue.
We receive the funding once the issue is completed & confirmed by you.
Thank you in advance for helping prioritize & fund our backlog.

The text was updated successfully, but these errors were encountered:

Kingsleyandher · 2023-05-30T07:54:42Z

class HutchinsonEstimator(HessianEstimator):
    def estimate(self, p, grad):
        u = torch.randn_like(grad)
        grad_dot_u = torch.sum(grad * u)
        print(f"grad_dot_u requires grad: {grad_dot_u.requires_grad}")   #  -> False
        
        # ↓  RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn.
        hessian_vector_product = torch.autograd.grad(    
            grad_dot_u, p, retain_graph=True)[0]
        return u * hessian_vector_product

Kingsleyandher · 2023-05-30T07:56:17Z

This problem same like #7 .

liuslnlp · 2023-06-04T04:16:17Z

Hello @Kingsleyandher , I meet the same question, is your problem solved?

Kingsleyandher changed the title ~~Megatron~~ Using Megatron Train GPT3 May 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using Megatron Train GPT3 #21

Using Megatron Train GPT3 #21

Kingsleyandher commented May 30, 2023 •

edited by polar-sh bot

Loading

Kingsleyandher commented May 30, 2023

Kingsleyandher commented May 30, 2023

liuslnlp commented Jun 4, 2023

Using Megatron Train GPT3 #21

Using Megatron Train GPT3 #21

Comments

Kingsleyandher commented May 30, 2023 • edited by polar-sh bot Loading

Upvote & Fund

Kingsleyandher commented May 30, 2023

Kingsleyandher commented May 30, 2023

liuslnlp commented Jun 4, 2023

Kingsleyandher commented May 30, 2023 •

edited by polar-sh bot

Loading