Skip to content

Conversation

@ztang2370
Copy link
Contributor

Issue: #91

This is WIP, a basic working example.

TODO:

  1. Include the sleep and wakeup functionality based on the traffic monitoring status.
  2. As vllm semantic router doesn't have a release for now, the patches need to be updated later after they have a release.

@cui36
Copy link
Collaborator

cui36 commented Sep 28, 2025

Great! I think the sleep and wakeup functionality has already been merged, right? @jiarong0907

@ztang2370
Copy link
Contributor Author

ztang2370 commented Sep 29, 2025

Great! I think the sleep and wakeup functionality has already been merged, right? @jiarong0907

I think the traffic monitoring and sleep management are tied with the router. Since we are using the vllm semantic router here instead of the controller/router, sleep management is unsupported. To enable it, we might need to integrate traffic monitoring and sleep management into the semantic router.

@jiarong0907
Copy link
Collaborator

@ztang2370 @cui36 Having vllm semantic router is great, but I would suggest we add it as a feature later.

For the example, we can just use the current router we have. The example just needs to show the features of the router and sleeping. This will be the 03_model_router_sleep.

@ztang2370
Copy link
Contributor Author

@ztang2370 @cui36 Having vllm semantic router is great, but I would suggest we add it as a feature later.

For the example, we can just use the current router we have. The example just needs to show the features of the router and sleeping. This will be the 03_model_router_sleep.

But my understanding is the features of routing and sleeping are already shown in controller, isn't it?

@jiarong0907
Copy link
Collaborator

@ztang2370 @cui36 Having vllm semantic router is great, but I would suggest we add it as a feature later.
For the example, we can just use the current router we have. The example just needs to show the features of the router and sleeping. This will be the 03_model_router_sleep.

But my understanding is the features of routing and sleeping are already shown in controller, isn't it?

Yes, but we need an end to end example to tell users how this works and can be used.

@cui36
Copy link
Collaborator

cui36 commented Sep 30, 2025

@ztang2370 @cui36 Having vllm semantic router is great, but I would suggest we add it as a feature later.
For the example, we can just use the current router we have. The example just needs to show the features of the router and sleeping. This will be the 03_model_router_sleep.

But my understanding is the features of routing and sleeping are already shown in controller, isn't it?

Yes, but we need an end to end example to tell users how this works and can be used.

Oh I see. Will update some info there today.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants