Can we only replace part of nn.Linear with te.Linear and others keep unchanged? #1595

zigzagcai · 2025-03-20T11:22:52Z

No description provided.

pggPL · 2025-03-20T12:45:13Z

I'm not sure what you mean - if you want to run some Linear layers in fp8 and the rest in higher precision, or you want to run for example forward in fp8 and backward in high precision. Both of this scenarios will be possible when this PR will be merged (hopefully this week).

zigzagcai · 2025-03-20T14:47:30Z

I'm not sure what you mean - if you want to run some Linear layers in fp8 and the rest in higher precision, or you want to run for example forward in fp8 and backward in high precision. Both of this scenarios will be possible when this PR will be merged (hopefully this week).

Thank you! I mean run some layers in fp8 and other's in high precision.

ptrendx · 2025-03-24T16:56:46Z

Yes, you can do that. You can either just leave some layers as nn.Linear or you can nest the fp8_autocast context manager, something like this:

with fp8_autocast(enabled=True):
    y = te_linear1(x)  # will compute in FP8
    with fp8_autocast(enabled=False):
        z = te_linear2(y)  # will compute in high precision

lengerfulluse · 2025-03-27T17:58:09Z

Both of this scenarios will be possible when this #1441 will be merged (hopefully this week).

Hi @pggPL , looks the original PR has been closed and split into 4 PRs. May i know when can we expected these changes been merged into TE?

pggPL · 2025-03-31T09:09:05Z

I want to merge them as soon as possible, there was temporal shortage of reviewers due to other deadlines with higher priority, but I hope it will be merged soon.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we only replace part of nn.Linear with te.Linear and others keep unchanged? #1595

Can we only replace part of nn.Linear with te.Linear and others keep unchanged? #1595

zigzagcai commented Mar 20, 2025

pggPL commented Mar 20, 2025

zigzagcai commented Mar 20, 2025

ptrendx commented Mar 24, 2025

lengerfulluse commented Mar 27, 2025

pggPL commented Mar 31, 2025

Can we only replace part of nn.Linear with te.Linear and others keep unchanged? #1595

Can we only replace part of nn.Linear with te.Linear and others keep unchanged? #1595

Comments

zigzagcai commented Mar 20, 2025

pggPL commented Mar 20, 2025

zigzagcai commented Mar 20, 2025

ptrendx commented Mar 24, 2025

lengerfulluse commented Mar 27, 2025

pggPL commented Mar 31, 2025