Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Performance] Estimate the cost of Atomic ops with the MMA layout. #2459

Open
LiyangLingIntel opened this issue Oct 10, 2024 · 0 comments
Open

Comments

@LiyangLingIntel
Copy link
Contributor

To address the comment #2312 (comment).

To fix the issue that in certain gemm cases, ConvertLayoutOp requires oversized shared local memory. In #2312, we use Atomic with mma layout to eliminate ConvertLayoutOp.

In this issue, we need to estimate the cost of Atomic ops with the MMA layout, compare it with the cost of ConvertLayout + AtomicRMW. With the knowledge about the performance difference, we can decide if a cost module needed or not in that pass.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants