Skip to content

[TLE][HCU] Support alloc, copy, local_ptr and pipeline#729

Open
alexshuang wants to merge 2 commits into
flagos-ai:triton_v3.6.xfrom
alexshuang:tle_basic
Open

[TLE][HCU] Support alloc, copy, local_ptr and pipeline#729
alexshuang wants to merge 2 commits into
flagos-ai:triton_v3.6.xfrom
alexshuang:tle_basic

Conversation

@alexshuang

Copy link
Copy Markdown

No description provided.

@sunnycase

Copy link
Copy Markdown
Collaborator

Thanks for the work here. Before this is finalized, could you please add a concise summary of the TLE primitive implementation plan?

It would be helpful to cover the main design points, such as the abstraction/lowering flow, compiler/runtime integration points, supported operator scope, dtype/shape/backend limitations, and the validation approach. Could you also include performance data for a few representative operators, ideally with baseline vs. TLE primitive numbers, test shapes, hardware/backend configuration, and measurement methodology?

For the expected level of detail and presentation style, PR #617 could be a useful reference: #617

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants