Remove set fp32 math mode & increase tolerance #1860

ZzEeKkAa · 2024-08-12T20:12:31Z

Kind of resolves #1769 by removing math mode change and increasing absolute tolerance.

python/tutorials/10-experimental-block-pointer.py

FMarno · 2024-08-14T10:04:08Z

I'm not really sure why this is happening and it seems weird to me. Do you know if the test ever passed and when it stopped passing?

ZzEeKkAa · 2024-08-14T13:32:06Z

I'm not really sure why this is happening and it seems weird to me. Do you know if the test ever passed and when it stopped passing?

It is working with IPEX, but never worked with upstream pytorch.

vlad-penkin · 2024-08-14T17:30:39Z

@whitneywhtsang, @etiotto, @FMarno background info for this change

module 'torch.xpu' has no attribute 'set_fp32_math_mode' pytorch/pytorch#132948

etiotto · 2024-08-14T15:41:41Z

python/tutorials/10-experimental-block-pointer.py

@@ -327,7 +327,6 @@ def matmul(a, b, accum_dtype, res_dtype):
 # Still we can test our matrix multiplication with block pointers against a native torch implementation (i.e., cuBLAS).

 torch.manual_seed(0)
-torch.xpu.set_fp32_math_mode(torch.xpu.utils.FP32MathMode.TF32)


This should work also when using upstream PyTorch. I do not think we should be fixing the tutorial.

Point of @guangyey is that to use this workaround until this feature is implemented in the upstream

Hi, @etiotto
set_fp32_math_mode is not yet upstreamed to stock PyTorch. And we need time to redesign this API according to other backends. So I personally recommend @ZzEeKkAa to do this workaround until we complete this API.

This tutorial is not currently present upstream

Hi, @etiotto set_fp32_math_mode is not yet upstreamed to stock PyTorch. And we need time to redesign this API according to other backends. So I personally recommend @ZzEeKkAa to do this workaround until we complete this API.

OK. We can put this in to unblock the work of migrating to use PyTorch (instead of IPEX). @ZzEeKkAa please add a FIXME in the code and open an issue so that once we have support in PyTorch for set_fp32_math_mode we can go back and revert this change. Once that is done I will be able to approve the PR. Thanks.

Going by Julian's comment, we might also want to add a ticket to support true fp32 matmul. This will of course have the downside of not using DPAS so will be slow by default (not ideal, I know).

@ZzEeKkAa Please create the 2 issues as described above.

Ping. @ZzEeKkAa are the 2 issues mentioned opened ? Links?

@whitneywhtsang @etiotto I've just opened the issues and updated the PR with FIXME comment.
#1956
#1957

jopperm · 2024-08-15T10:08:23Z

I think you should at least add a comment explaining why the (torch.float32, torch.float32, torch.float32) test case needs higher tolerance (Triton kernel uses TF32 precision, reference data is computed with full FP32 precision).

Alternatively, could you detect whether IPEX is available and deactivate this particular test until upstream support is available?

vlad-penkin · 2024-08-15T12:29:45Z

I think you should at least add a comment explaining why the (torch.float32, torch.float32, torch.float32) test case needs higher tolerance (Triton kernel uses TF32 precision, reference data is computed with full FP32 precision).

Alternatively, could you detect whether IPEX is available and deactivate this particular test until upstream support is available?

Alternative is not an option. Dependency on IPEX in our code base will be fully deprecated in our code base within a week - week an half.

Closes #1840 CI on PyTorch main status with noop IPEX: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/10422253663 For rolling there is the following error (10 tutorial): * `AttributeError: module 'torch.xpu' has no attribute 'set_fp32_math_mode'` (should be fixed by #1860) For LTS: * `Failed to create Level Zero tracer: 2013265921` (instrumentation/test_gpuhello.py) Signed-off-by: Anatoly Myachev <[email protected]>

FMarno · 2024-08-19T13:43:06Z

I think this is a good enough solution for now but we should create an issue to actually support fp32 precision through the dot operation.

whitneywhtsang requested review from jopperm and a team August 12, 2024 23:31

FMarno reviewed Aug 13, 2024

View reviewed changes

python/tutorials/10-experimental-block-pointer.py Show resolved Hide resolved

whitneywhtsang requested review from etiotto and a team August 14, 2024 15:58

etiotto reviewed Aug 14, 2024

View reviewed changes

etiotto assigned ZzEeKkAa Aug 16, 2024

anmyachev mentioned this pull request Aug 16, 2024

Update PyTorch upstream pin #1897

Merged

This was referenced Aug 20, 2024

Support true fp32 matmul #1956

Open

Use fp32 math mode at 10th tutorial #1957

Open

Remove set fp32 math mode & increase tolerance

75b3ff7

ZzEeKkAa force-pushed the fix/remove_math_mode_in_10th_tutorial branch from aeae30d to 75b3ff7 Compare August 20, 2024 21:58

whitneywhtsang approved these changes Aug 20, 2024

View reviewed changes

jopperm approved these changes Aug 21, 2024

View reviewed changes

FMarno approved these changes Aug 21, 2024

View reviewed changes

etiotto merged commit 2d317a5 into intel:llvm-target Aug 21, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove set fp32 math mode & increase tolerance #1860

Remove set fp32 math mode & increase tolerance #1860

ZzEeKkAa commented Aug 12, 2024

FMarno commented Aug 14, 2024

ZzEeKkAa commented Aug 14, 2024

vlad-penkin commented Aug 14, 2024

etiotto Aug 14, 2024

ZzEeKkAa Aug 15, 2024

guangyey Aug 15, 2024 •

edited

Loading

FMarno Aug 15, 2024

etiotto Aug 16, 2024 •

edited

Loading

FMarno Aug 16, 2024

whitneywhtsang Aug 19, 2024

etiotto Aug 20, 2024

ZzEeKkAa Aug 20, 2024

jopperm commented Aug 15, 2024

vlad-penkin commented Aug 15, 2024

FMarno commented Aug 19, 2024

Remove set fp32 math mode & increase tolerance #1860

Remove set fp32 math mode & increase tolerance #1860

Conversation

ZzEeKkAa commented Aug 12, 2024

FMarno commented Aug 14, 2024

ZzEeKkAa commented Aug 14, 2024

vlad-penkin commented Aug 14, 2024

etiotto Aug 14, 2024

Choose a reason for hiding this comment

ZzEeKkAa Aug 15, 2024

Choose a reason for hiding this comment

guangyey Aug 15, 2024 • edited Loading

Choose a reason for hiding this comment

FMarno Aug 15, 2024

Choose a reason for hiding this comment

etiotto Aug 16, 2024 • edited Loading

Choose a reason for hiding this comment

FMarno Aug 16, 2024

Choose a reason for hiding this comment

whitneywhtsang Aug 19, 2024

Choose a reason for hiding this comment

etiotto Aug 20, 2024

Choose a reason for hiding this comment

ZzEeKkAa Aug 20, 2024

Choose a reason for hiding this comment

jopperm commented Aug 15, 2024

vlad-penkin commented Aug 15, 2024

FMarno commented Aug 19, 2024

guangyey Aug 15, 2024 •

edited

Loading

etiotto Aug 16, 2024 •

edited

Loading