Skip to content

Conversation

@PaulZhang12
Copy link
Contributor

@PaulZhang12 PaulZhang12 commented Oct 15, 2025

Stacked PRs:


Co-author: @yf225

Epilogue Subtiling

Add it as an opt-in feature currently, as support for complex epilogues (such as loading a bias + adding to accumulator) is difficult and not currently supported. Furthermore, most kernels do not require epilogue subtiling, as it is generally useful for GEMMs in which the accumulator lives in TMEM for B200.

GEMM CI exhibits ~4% gain, epilogue_subtiling=[2] is often picked as the final config, 0.88x with subtiling, 0.84x without
image

PaulZhang12 added a commit that referenced this pull request Oct 15, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 15, 2025
PaulZhang12 added a commit that referenced this pull request Oct 15, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 15, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 15, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
Copy link
Contributor

@jansel jansel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this help with matmul perf?

PaulZhang12 added a commit that referenced this pull request Oct 16, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 17, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 20, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 20, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
@PaulZhang12 PaulZhang12 changed the base branch from main to PaulZhang12/stack/16 October 20, 2025 19:20
@PaulZhang12 PaulZhang12 changed the base branch from PaulZhang12/stack/16 to main October 20, 2025 19:22
PaulZhang12 added a commit that referenced this pull request Oct 20, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
@PaulZhang12 PaulZhang12 changed the base branch from main to PaulZhang12/stack/16 October 20, 2025 19:22
@PaulZhang12 PaulZhang12 changed the base branch from PaulZhang12/stack/16 to main October 20, 2025 19:24
PaulZhang12 added a commit that referenced this pull request Oct 20, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
@PaulZhang12 PaulZhang12 changed the base branch from main to PaulZhang12/stack/16 October 20, 2025 19:25
@PaulZhang12 PaulZhang12 changed the base branch from PaulZhang12/stack/16 to main October 20, 2025 19:28
PaulZhang12 added a commit that referenced this pull request Oct 22, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 27, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 30, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Oct 30, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
@jansel
Copy link
Contributor

jansel commented Nov 2, 2025

Any perf data on this one?

PaulZhang12 added a commit that referenced this pull request Nov 3, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
PaulZhang12 added a commit that referenced this pull request Nov 5, 2025
stack-info: PR: #948, branch: PaulZhang12/stack/14
stack-info: PR: #948, branch: PaulZhang12/stack/14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants