Skip to content

[release/3.2.2](feat) Sync verified commits from dev to release/3.2.2#807

Merged
WuTYSFG merged 7 commits into
triton-lang:release/3.2.2from
SUZUKODAZE:merging_dev_0624
Jun 25, 2026
Merged

[release/3.2.2](feat) Sync verified commits from dev to release/3.2.2#807
WuTYSFG merged 7 commits into
triton-lang:release/3.2.2from
SUZUKODAZE:merging_dev_0624

Conversation

@SUZUKODAZE

Copy link
Copy Markdown

PR Title:
release/3.2.2 Sync verified commits from dev to release/3.2.2

Description:

This PR synchronizes the following verified commits from the dev branch to the release/3.2.2 branch. All commits have passed CI verification and are ready for integration.

Synchronized Commits:

Commit Subject Author
[CI] update default code owners (#754) hongziqi
[ssbuffer] nd2nz matmul C padding (#670) lrf22
[ssbuffer] fix tensor.empty+linalg.fill pos (#759) cxtverygood
[ssbuffer] fix some code warnings (#764) zhuxinguang33
[ssbuffer] Modification of potential code safety issues (#769) Hsyy04
[TileChunkCoalescing] chooseH exceeds maxH causing UB overflow (#779) Popeyesxs
[ssbuffer] process yield op in scf.if (#476) lrf22

Summary of Changes:

  • Bug Fixes:
    • Fixed nd2nz matmul C padding issue in ssbuffer
    • Fixed tensor.empty and linalg.fill position issues
    • Fixed various code warnings in ssbuffer
    • Fixed potential code safety issues in ssbuffer
    • Fixed Two bugs in the H selection logic
    • Fixed yield operation handling in scf.if for ssbuffer
    • Skip ND2NZ MatmulC for non-zero matrices requiring padding
  • Documentation: Updated default code owners in CI
    All changes have been cherry-picked from the dev branch and verified locally. No conflicts were encountered during integration.

hongziqi and others added 6 commits June 25, 2026 17:11
…riton-lang#779)

Two bugs in the H selection logic:

1. The two-loop search in chooseH did not bound the first loop by maxH.
   When numTiles = 2 * p (p prime, e.g. 262142 = 2 * 131071), the smallest
   divisor >= hMin is p itself, vastly exceeding the UB-derived maxH.
   Fix: single downward search from maxH to hMin.

2. The maxH clamp (if maxH < 2, force maxH = 2) could override the UB
   budget conclusion. When footprintUnit is large enough that even H=2
   overflows UB, the clamp forced coalescing anyway.
   Fix: bail out (return without coalescing) when maxH < 2.

Also remove redundant comments throughout the file.

Co-authored-by: xudezheng <xudezheng1@huawei.com>
* [ssbuffer](fix) process yield op in scf.if

* [ssbuffer](fix) standardize attr name in SplitDataflow
@github-actions

Copy link
Copy Markdown

📂 Code Owners — the following files under code ownership were modified.

Please review:

@KanuaK

  • .github/CODEOWNERS
  • third_party/ascend/include/DynamicCVPipeline/AddControlFlowCondition/UpdateConditionInfo.h
  • third_party/ascend/include/DynamicCVPipeline/AddControlFlowCondition/Utils.h
  • third_party/ascend/include/DynamicCVPipeline/Common/Utils.h
  • third_party/ascend/include/DynamicCVPipeline/PlanComputeBlock/PlanCubeBlockPass.h
  • third_party/ascend/include/DynamicCVPipeline/SplitDataflow/InterCoreTransferAndSync.h
  • third_party/ascend/include/DynamicCVPipeline/SplitDataflow/Utils.h
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/InitDependentMap.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/UpdateConditionInfo.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/Utils.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AllocMultiCache/AddMultiBufferInnerScope.cpp
  • third_party/ascend/lib/DynamicCVPipeline/Common/Utils.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/Common.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/FixpipeOptPass.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/UnifyAllocBlockPass.cpp
  • third_party/ascend/lib/DynamicCVPipeline/PlanComputeBlock/ComputeBlockIdManager.cpp
  • third_party/ascend/lib/DynamicCVPipeline/PlanComputeBlock/PlanCubeBlock.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/AddBlockIdForControlOps.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/DataDependencyAnalysis.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/InterCoreTransferAndSync.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/MarkMainLoop.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/Utils.cpp
  • third_party/ascend/lib/TritonToLinalg/TileChunkCoalescing.cpp
  • third_party/ascend/unittest/Conversion/General/DynamicCVPipeline/SplitDataflow/test_yield_if_dependencies.mlir

@WuTYSFG

  • .github/CODEOWNERS
  • third_party/ascend/include/DynamicCVPipeline/AddControlFlowCondition/UpdateConditionInfo.h
  • third_party/ascend/include/DynamicCVPipeline/AddControlFlowCondition/Utils.h
  • third_party/ascend/include/DynamicCVPipeline/Common/Utils.h
  • third_party/ascend/include/DynamicCVPipeline/PlanComputeBlock/PlanCubeBlockPass.h
  • third_party/ascend/include/DynamicCVPipeline/SplitDataflow/InterCoreTransferAndSync.h
  • third_party/ascend/include/DynamicCVPipeline/SplitDataflow/Utils.h
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/InitDependentMap.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/UpdateConditionInfo.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/Utils.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AllocMultiCache/AddMultiBufferInnerScope.cpp
  • third_party/ascend/lib/DynamicCVPipeline/Common/Utils.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/Common.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/FixpipeOptPass.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/UnifyAllocBlockPass.cpp
  • third_party/ascend/lib/DynamicCVPipeline/PlanComputeBlock/ComputeBlockIdManager.cpp
  • third_party/ascend/lib/DynamicCVPipeline/PlanComputeBlock/PlanCubeBlock.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/AddBlockIdForControlOps.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/DataDependencyAnalysis.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/InterCoreTransferAndSync.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/MarkMainLoop.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/Utils.cpp
  • third_party/ascend/lib/TritonToLinalg/TileChunkCoalescing.cpp
  • third_party/ascend/unittest/Conversion/General/DynamicCVPipeline/SplitDataflow/test_yield_if_dependencies.mlir

@hongziqi

  • .github/CODEOWNERS
  • third_party/ascend/include/DynamicCVPipeline/AddControlFlowCondition/UpdateConditionInfo.h
  • third_party/ascend/include/DynamicCVPipeline/AddControlFlowCondition/Utils.h
  • third_party/ascend/include/DynamicCVPipeline/Common/Utils.h
  • third_party/ascend/include/DynamicCVPipeline/PlanComputeBlock/PlanCubeBlockPass.h
  • third_party/ascend/include/DynamicCVPipeline/SplitDataflow/InterCoreTransferAndSync.h
  • third_party/ascend/include/DynamicCVPipeline/SplitDataflow/Utils.h
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/InitDependentMap.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/UpdateConditionInfo.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AddControlFlowCondition/Utils.cpp
  • third_party/ascend/lib/DynamicCVPipeline/AllocMultiCache/AddMultiBufferInnerScope.cpp
  • third_party/ascend/lib/DynamicCVPipeline/Common/Utils.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/Common.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/FixpipeOptPass.cpp
  • third_party/ascend/lib/DynamicCVPipeline/ComputeBlockOpt/UnifyAllocBlockPass.cpp
  • third_party/ascend/lib/DynamicCVPipeline/PlanComputeBlock/ComputeBlockIdManager.cpp
  • third_party/ascend/lib/DynamicCVPipeline/PlanComputeBlock/PlanCubeBlock.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/AddBlockIdForControlOps.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/DataDependencyAnalysis.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/InterCoreTransferAndSync.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/MarkMainLoop.cpp
  • third_party/ascend/lib/DynamicCVPipeline/SplitDataflow/Utils.cpp
  • third_party/ascend/lib/TritonToLinalg/TileChunkCoalescing.cpp
  • third_party/ascend/unittest/Conversion/General/DynamicCVPipeline/SplitDataflow/test_yield_if_dependencies.mlir

🤖 Generated by labeler workflow

@WuTYSFG WuTYSFG merged commit fd1cee7 into triton-lang:release/3.2.2 Jun 25, 2026
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants