-
Notifications
You must be signed in to change notification settings - Fork 1.7k
feat: Minor improvements to build.py #8362
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
1e2cbd3
to
f940474
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR enhances the build.py
script with four new command-line flags to increase build flexibility, particularly for development workflows and external contributors working with forks.
- Adds
--no-container-cache
flag to disable Docker cache during container builds - Adds
--default-repo-tag
flag to override calculated default repository tags globally - Adds
--use-buildbase
flag to use the temporary "buildbase" image for backend builds - Extends
--backend
syntax to support custom organizations/repositories
Reviewed Changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.
File | Description |
---|---|
build.py | Implements the four new flags and their associated logic, including regex parsing for backend syntax and image mapping updates |
docs/customization_guide/build.md | Documents the new --use-buildbase flag and extended backend syntax with organization support |
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
pattern = r"(https?:\/\/[^\s:]+)|:" | ||
parts = list(filter(None,re.split(pattern, be))) |
Copilot
AI
Sep 17, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The regex pattern and splitting logic is complex and may be difficult to maintain. Consider extracting this parsing logic into a separate function with clear documentation about the expected input format and returned structure.
Copilot uses AI. Check for mistakes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agreed. My followup branch is not quite finished yet, but involves a more thorough redesign that will avoid the need for any custom parsing.
for be in FLAGS.backend: | ||
parts = be.split(":") | ||
pattern = r"(https?:\/\/[^\s:]+)|:" | ||
parts = list(filter(None,re.split(pattern, be))) |
Copilot
AI
Sep 17, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Missing space after comma in function call. Should be filter(None, re.split(pattern, be))
.
parts = list(filter(None,re.split(pattern, be))) | |
parts = list(filter(None, re.split(pattern, be))) |
Copilot uses AI. Check for mistakes.
Thanks for the contribution! I know we've been distracted, but we're doing our best to catch back up. This looks good to me. @mc-nv is the expert here. I believe he's out of office until next week, hopefully after he's back and the dust settled, I can get his review here. |
Despite change is looks trivial it will touch each and every docker build scenario. I would defer. |
@mc-nv I am working on finishing a branch with more involved changes to try to increase consistency in the command-line interface and make it easier to add new options. I had submitted this PR first because the feature additions were relatively minor and generally useful on their own, separate from the more disruptive reorganization. I am happy to perform any necessary validation of this PR or the next one to the extent that I can. Please let me know what is required. |
Also, if you prefer just to deal with one PR because of the intensive validation process, it is fine with me to wait until the larger PR is ready. |
To apply this change we need to make sure that change doesn't break any instructions which we have in our build process. I'm short on resource capacity to cover all the internal requests, I understand the intention, but current change doesn't show the full picture. From my point of view it's a whole story action, that will touch multiple points including CMake configuration. Idea to add flex in build to source any forked repo may be worth attention, but we need to follow up on it as part of the milestone as well as it may lead to the unexpected behavior. There could be pros and cons, @nvda-mesharma and @dmitry-tokarev-nv can make a decision on this effort. |
I have submitted the larger revamp PR at #8437. I understand this will take some time to review and validate. I am happy to help out in whatever way I can as an external (but longtime) contributor. |
(I am leaving this one open because, if #8437 proves not to be acceptable, I would still like to incorporate the minor features added here.) |
What does the PR do?
This PR adds three new flags and updates one flag in
build.py
. These changes all go toward increasing build flexibility.--no-container-cache
, which propagates todocker --no-cache
.--default-repo-tag <tag>
to override the calculated default value, which is not always appropriate. For example, when trying to build a dev version (currently 25.08), the upstream container version is set to the previous version (25.07), which takes precedence in the calculated default value, but using the corresponding dev versions of component and backend repositories may be intended. Rather than having to override it for each repo, it is useful to be able to override it globally.--use-buildbase
to use the temporary "buildbase" image as the "base" image for backends that need it (e.g. onnxruntime).--backend
syntax to<backend-name>[:<repo-tag>][:<org>]
to allow specifying a different organization/repository, in addition to a different tag/branch. This is useful for external contributors developing in forks.Checklist
Agreement
<commit_type>: <Title>
pre-commit install, pre-commit run --all
)Commit Type:
Check the conventional commit type
box here and add the label to the github PR.
Related PRs:
N/A
Where should the reviewer start?
Changes only affect build.py and associated documentation.
Test plan:
These changes only affect the build script, so there is no impact on server functionality.
An example of using some of these options:
in branch
r25.08
(without these changes):output:
Building Triton Inference Server
platform linux
machine x86_64
version 2.60.0dev
build dir /storage/local/data2/pedrok/sonic/server/build
install dir None
cmake dir None
default repo-tag: r25.07
container version 25.08dev
upstream container version 25.07
endpoint "http"
endpoint "grpc"
endpoint "sagemaker"
endpoint "vertex-ai"
filesystem "gcs"
filesystem "s3"
filesystem "azure_storage"
backend "ensemble" at tag/branch "r25.07"
backend "identity" at tag/branch "r25.07"
backend "square" at tag/branch "r25.07"
backend "repeat" at tag/branch "r25.07"
backend "onnxruntime" at tag/branch "r25.07"
backend "python" at tag/branch "r25.07"
backend "dali" at tag/branch "r25.07"
backend "pytorch" at tag/branch "r25.07"
backend "openvino" at tag/branch "r25.07"
backend "fil" at tag/branch "r25.07"
backend "tensorrt" at tag/branch "r25.07"
repoagent "checksum" at tag/branch "r25.07"
cache "local" at tag/branch "r25.07"
cache "redis" at tag/branch "r25.07"
component "common" at tag/branch "r25.07"
component "core" at tag/branch "r25.07"
component "backend" at tag/branch "r25.07"
component "thirdparty" at tag/branch "r25.07"
After merging this branch into
r25.08
, the following is possible:output:
Building Triton Inference Server
platform linux
machine x86_64
version 2.60.0dev
build dir /storage/local/data2/pedrok/sonic/server/build
install dir None
cmake dir None
default repo-tag: r25.08
container version 25.08dev
upstream container version 25.07
endpoint "http"
endpoint "grpc"
endpoint "sagemaker"
endpoint "vertex-ai"
filesystem "gcs"
filesystem "s3"
filesystem "azure_storage"
backend "onnxruntime" at tag/branch "r25.08_kjp" from org "https://github.com/kpedro88"
backend "ensemble" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "identity" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "square" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "repeat" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "python" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "dali" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "pytorch" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "openvino" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "fil" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
backend "tensorrt" at tag/branch "r25.08" from org "https://github.com/triton-inference-server"
repoagent "checksum" at tag/branch "r25.08"
cache "local" at tag/branch "r25.08"
cache "redis" at tag/branch "r25.08"
component "common" at tag/branch "r25.08"
component "core" at tag/branch "r25.08"
component "backend" at tag/branch "r25.08"
component "thirdparty" at tag/branch "r25.08"
Caveats:
The syntax for changing the backend org could be more elegant, and the feature is not extended to other components besides backends. I am planning a more thorough improvement to
build.py
that will address this in a followup PR, but it will be a more involved change and is still in progress.Background
These changes were useful in building a
25.08dev
server to incorporate #8335, which includes an important bug fix and is not yet in a released version.Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)
N/A