-
Notifications
You must be signed in to change notification settings - Fork 72
[Example] Add vllm semantic router example #135
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
[Example] Add vllm semantic router example #135
Conversation
|
Great! I think the sleep and wakeup functionality has already been merged, right? @jiarong0907 |
I think the traffic monitoring and sleep management are tied with the router. Since we are using the vllm semantic router here instead of the controller/router, sleep management is unsupported. To enable it, we might need to integrate traffic monitoring and sleep management into the semantic router. |
|
@ztang2370 @cui36 Having vllm semantic router is great, but I would suggest we add it as a feature later. For the example, we can just use the current router we have. The example just needs to show the features of the router and sleeping. This will be the |
But my understanding is the features of routing and sleeping are already shown in controller, isn't it? |
Yes, but we need an end to end example to tell users how this works and can be used. |
Oh I see. Will update some info there today. |
Issue: #91
This is WIP, a basic working example.
TODO: