You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the case of very small numbers input numbers around the subnormal range of torch.float or torch.bfloat16, the scale exponent will take its smallest unbiased value: -127. However, you only allow division with a scale of 2**-126 in line 143 of mx_tensor.py. This is because of an incompatibility with triton.
However, during dequantization you use the the smaller scale of 2**-127 when calling