Question about dtype check in marlin_qqq validation for w4a8 functionality

Hi torchao developers,

Recently, while experimenting with the w4a8 functionality in torchao, I noticed that the marlin_qqq check function requires 

> input_tensor.dtype == torch.float16

This seems potentially problematic, as most modern models typically use **bf16** or **fp32** for activation values. Forcing a conversion to float16 might introduce precision loss or even NaN issues in some cases.

Could you clarify if this dtype check is strictly necessary? Are there specific constraints or optimizations that depend on float16 here?

Thank you for your insights!

![Image](https://github.com/user-attachments/assets/7367cc8c-a6bd-4b24-9a28-f68d3af49e3c)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about dtype check in marlin_qqq validation for w4a8 functionality #2115

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about dtype check in marlin_qqq validation for w4a8 functionality #2115

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions