[PyTorch] Run all tests, even if one fails. #1501

pggPL · 2025-02-21T18:21:24Z

Description

Tests for PyTorch exit if one testing script returns non-zero and rest of the tests scripts are not run. This PR changes that - all tests are run and if at least one failed, then non-zero is returned.

I thought about different designs - like run all directory at one in pytest, but we need to run some scripts with flags. So I ended with the design proposed by @timmoon10

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Pawel Gadzinski <[email protected]>

for more information, see https://pre-commit.ci

pggPL · 2025-02-21T18:32:50Z

/te-ci pytorch L1

timmoon10

LGTM

timmoon10 · 2025-02-21T22:55:38Z

qa/L1_pytorch_distributed_unittest/test.sh

+pytest -v -s $TE_PATH/tests/pytorch/distributed/test_numerics.py || FAIL=1
+pytest -v -s $TE_PATH/tests/pytorch/distributed/test_fusible_ops.py || FAIL=1
+pytest -v -s $TE_PATH/tests/pytorch/distributed/test_torch_fsdp2.py || FAIL=1
+pytest -v -s $TE_PATH/tests/pytorch/distributed/test_comm_gemm_overlap.py || FAIL=1
 # pytest -v -s $TE_PATH/tests/pytorch/distributed/test_fusible_ops_with_userbuffers.py  ### TODO Debug UB support with te.Sequential


Suggested change

# pytest -v -s $TE_PATH/tests/pytorch/distributed/test_fusible_ops_with_userbuffers.py ### TODO Debug UB support with te.Sequential

# pytest -v -s $TE_PATH/tests/pytorch/distributed/test_fusible_ops_with_userbuffers.py || FAIL=1 ### TODO Debug UB support with te.Sequential

ksivaman

LGTM

pggPL and others added 4 commits February 21, 2025 17:51

non-exit tests

d805d35

Signed-off-by: Pawel Gadzinski <[email protected]>

fix

f0d7ba3

Signed-off-by: Pawel Gadzinski <[email protected]>

fix

0561e19

Signed-off-by: Pawel Gadzinski <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

0cbda43

for more information, see https://pre-commit.ci

pggPL requested a review from timmoon10 February 21, 2025 18:22

pggPL marked this pull request as ready for review February 21, 2025 18:33

timmoon10 approved these changes Feb 21, 2025

View reviewed changes

timmoon10 added the testing Improvements to tests or testing infrastructure label Feb 21, 2025

ksivaman approved these changes Feb 24, 2025

View reviewed changes

Merge branch 'main' into run_all_tests

114aa36

pggPL merged commit 229dd04 into NVIDIA:main Feb 24, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch] Run all tests, even if one fails. #1501

[PyTorch] Run all tests, even if one fails. #1501

pggPL commented Feb 21, 2025

pggPL commented Feb 21, 2025

timmoon10 left a comment

timmoon10 Feb 21, 2025

ksivaman left a comment

	# pytest -v -s $TE_PATH/tests/pytorch/distributed/test_fusible_ops_with_userbuffers.py ### TODO Debug UB support with te.Sequential
	# pytest -v -s $TE_PATH/tests/pytorch/distributed/test_fusible_ops_with_userbuffers.py \|\| FAIL=1 ### TODO Debug UB support with te.Sequential

[PyTorch] Run all tests, even if one fails. #1501

[PyTorch] Run all tests, even if one fails. #1501

Conversation

pggPL commented Feb 21, 2025

Description

Type of change

Checklist:

pggPL commented Feb 21, 2025

timmoon10 left a comment

Choose a reason for hiding this comment

timmoon10 Feb 21, 2025

Choose a reason for hiding this comment

ksivaman left a comment

Choose a reason for hiding this comment