-
Notifications
You must be signed in to change notification settings - Fork 449
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Add DeleteWorkersOnFailure
deletion policy for RayJob
#2765
Comments
cc @andrewsykim any thoughts? |
Having both policies probably makes sense. I'm in favor of a new policy like |
|
On second thought, I realized that users may have more combinations. For example,
There are two solutions:
|
mark this issue as v1.3.0 because we need to make a decision about the API before the release. |
These are really good considerations, since we put the feature behind an alpha feature gate I feel fine about breaking the API in v1.4 if needed. |
We can consider an API like this as well:
|
OK, let's update the API in v1.4.0. |
Search before asking
Description
DeleteWorkersOnFailure
: Deletes workers only when the Ray job fails and deletes the entire RayCluster when the Ray job succeeds. This seems to be a more common pattern for users.Should we add this policy or rename
DeleteWorkers
toDeleteWorkersOnFailure
? Does it need to be in v1.3.0?Use case
No response
Related issues
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: