Triton v3.6.x iluvatar backend and 5 TLE primitives support#724
Triton v3.6.x iluvatar backend and 5 TLE primitives support#724Salamanca001 wants to merge 2 commits into
Conversation
|
|
| if os.environ.get("FLAGTREE_BACKEND") == "iluvatar": | ||
| triton_pkg_dir = repo_root / "third_party" / "iluvatar" / "python" / "triton" | ||
| else: | ||
| triton_pkg_dir = repo_root / "python" / "triton" |
There was a problem hiding this comment.
| flagtree_add_tle_generated_header_dependencies() | ||
| endif() | ||
| add_subdirectory(test) | ||
| elif (FLAGTREE_BACKEND STREQUAL "iluvatar") |
There was a problem hiding this comment.
elseif(FLAGTREE_BACKEND STREQUAL "iluvatar")
sunnycase
left a comment
There was a problem hiding this comment.
Thanks for the work here. Before this is finalized, could you please add a concise summary of the TLE primitive implementation plan?
It would be helpful to cover the main design points, such as the abstraction/lowering flow, compiler/runtime integration points, supported operator scope, dtype/shape/backend limitations, and the validation approach. Could you also include performance data for a few representative operators, ideally with baseline vs. TLE primitive numbers, test shapes, hardware/backend configuration, and measurement methodology?
For the expected level of detail and presentation style, PR #617 could be a useful reference: #617
This PR brings the Iluvatar backend support onto Triton 3.6 in FlagTree and adds Iluvatar TLE lowering support.
Included commits:
7b4cac885[BACKEND] update iluvatar backend support on triton3.6.13abd15d8[TLE][ILUVATAR] Add TLE support for alloc, local_ptr, copy, extract_tile and insert_tile.Main changes:
third_party/iluvatarbackend integration, including compiler/driver entry points, Iluvatar GPU dialect, lowering passes, target info, utility code, build wiring, and test runner.alloc,local_ptr,copy,extract_tile, andinsert_tile.