-
Notifications
You must be signed in to change notification settings - Fork 1.2k
chore(apis): unpublish deprecated /v1/inference apis #3297
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore(apis): unpublish deprecated /v1/inference apis #3297
Conversation
…recated inference to openai-compat
|
@raghotham @ashwinb this can go into the 0.3 release and the impls can be removed later |
| @@ -1,187 +0,0 @@ | |||
| # Copyright (c) Meta Platforms, Inc. and affiliates. | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why was this removed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it was only testing with the removed api
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@mattf but then would the tests be meaningful if you converted the calls to the openai equivalents?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ported these over.
| @@ -1,545 +0,0 @@ | |||
| # Copyright (c) Meta Platforms, Inc. and affiliates. | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we had a bunch of tests here. just quickly checking if we have coverage for the openai endpoints in place?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ported most of these over. skipped the llama-4 one.
What does this PR do?
unpublish (make unavailable to users) the following apis -
/v1/inference/completion, replaced by/v1/openai/v1/completions/v1/inference/chat-completion, replaced by/v1/openai/v1/chat/completions/v1/inference/embeddings, replaced by/v1/openai/v1/embeddings/v1/inference/batch-completion, replaced by/v1/openai/v1/batches/v1/inference/batch-chat-completion, replaced by/v1/openai/v1/batchesnote: the implementations are still available for internal use, e.g. agents uses chat-completion.