support W4afp8 quant in v3.1 #337

Bruce-x-1997 · 2025-09-18T12:25:19Z

What does this PR do?

support w4afp8 quant in v3.1(ue8m0)

Usage

just set gemm_impl to fp8 and use config_v3.1.json

Testing

after applying this patch, our model(3.1 using w4afp8) could reach aime25 50% and aime24 60%

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes/No
Did you write any new necessary tests?: Yes/No
Did you add or update any necessary documentation?: Yes/No
Did you update Changelog?: Yes/No

Additional Information

copy-pr-bot · 2025-09-18T12:25:22Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: bruce.xu <[email protected]>

Bruce-x-1997 · 2025-09-18T13:02:51Z

@cjluo-nv please help review it, thanks

cjluo-nv · 2025-09-25T00:22:39Z

examples/deepseek/ptq.py

-            assert weight_quantizer is None
-            assert act_quantizer is None
-            x, scale = act_quant(x, block_size)
+            x, scale = act_quant(x, block_size, scale_fmt)


could you walk through how updating the scale will give you W4A8?

In this case, what's the 4bit weight? Is it NVFP4 or INT4?

sorry, I don't notice the comment before.
in my case, 4bit weight means int4.
in 3.1 case, when I want to quant the model, there is a interface mismatch with deepseek-v3.git
so I fix it @cjluo-nv

cjluo-nv · 2025-09-25T00:23:33Z

examples/deepseek/README.md

+	nvcr.io/nvidia/tensorrt-llm/release
+```
+then we can operate modelopt in the docker pod as (Trtllm example)[https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/models/core/deepseek_v3/README.md?plain=1]
+but we should notice that just using the latest DeepSeek-V3.git is ok, because there is a dtype bug in bias proto at commit 1398800.


would you recommend using a more recent commit?

Bruce-x-1997 requested a review from a team as a code owner September 18, 2025 12:25

Bruce-x-1997 requested a review from cjluo-nv September 18, 2025 12:25

support W4afp8 quant in v3.1

8824e8b

Signed-off-by: bruce.xu <[email protected]>

Bruce-x-1997 force-pushed the bruce-v3.1-w4afp8 branch from b3f4b5e to 8824e8b Compare September 18, 2025 12:37

cjluo-nv reviewed Sep 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support W4afp8 quant in v3.1 #337

support W4afp8 quant in v3.1 #337

Uh oh!

Bruce-x-1997 commented Sep 18, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Sep 18, 2025

Uh oh!

Bruce-x-1997 commented Sep 18, 2025

Uh oh!

cjluo-nv Sep 25, 2025

Uh oh!

Bruce-x-1997 Sep 30, 2025

Uh oh!

cjluo-nv Sep 25, 2025

Uh oh!

Uh oh!

support W4afp8 quant in v3.1 #337

Are you sure you want to change the base?

support W4afp8 quant in v3.1 #337

Uh oh!

Conversation

Bruce-x-1997 commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Sep 18, 2025

Uh oh!

Bruce-x-1997 commented Sep 18, 2025

Uh oh!

cjluo-nv Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Bruce-x-1997 Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

cjluo-nv Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Bruce-x-1997 commented Sep 18, 2025 •

edited

Loading