Skip to content

Commit

Permalink
Merge branch 'BerriAI:main' into main
Browse files Browse the repository at this point in the history
  • Loading branch information
hughcrt authored Jan 14, 2025
2 parents 5d81162 + 293bb4c commit a885d8b
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 7 deletions.
22 changes: 16 additions & 6 deletions docs/my-website/docs/benchmarks.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,13 +18,15 @@ model_list:
api_key: "test"
```
## 1 Instance LiteLLM Proxy
### 1 Instance LiteLLM Proxy
In these tests the median latency of directly calling the fake-openai-endpoint is 60ms.
| Metric | Litellm Proxy (1 Instance) |
|--------|------------------------|
| Median Latency (ms) | 110 |
| RPS | 250 |
| RPS | 475 |
| Median Latency (ms) | 100 |
| Latency overhead added by LiteLLM Proxy | 40ms |
<!-- <Image img={require('../img/1_instance_proxy.png')} /> -->
Expand All @@ -33,8 +35,9 @@ model_list:
<Image img={require('../img/instances_vs_rps.png')} /> -->
#### Key Findings
- Single instance: 250 RPS @ 100ms latency
- 4 LiteLLM instances: 1000 RPS @ 100ms latency
- Single instance: 475 RPS @ 100ms latency
- 2 LiteLLM instances: 950 RPS @ 100ms latency
- 4 LiteLLM instances: 1900 RPS @ 100ms latency
### 2 Instances
Expand All @@ -43,8 +46,15 @@ model_list:
| Metric | Litellm Proxy (2 Instances) |
|--------|------------------------|
| Median Latency (ms) | 100 |
| RPS | 500 |
| RPS | 950 |


## Machine Spec used for testing

Each machine deploying LiteLLM had the following specs:

- 2 CPU
- 4GB RAM



Expand Down
2 changes: 1 addition & 1 deletion ui/litellm-dashboard/src/components/view_key_table.tsx
Original file line number Diff line number Diff line change
Expand Up @@ -494,7 +494,7 @@ const ViewKeyTable: React.FC<ViewKeyTableProps> = ({
className="mt-8"
label="RPM Limit (requests per minute)"
name="rpm_limit"
help={`rpm_limit cannot exceed team max tpm_limit: ${keyTeam?.rpm_limit !== null && keyTeam?.rpm_limit !== undefined ? keyTeam?.rpm_limit : 'unlimited'}`}
help={`rpm_limit cannot exceed team max rpm_limit: ${keyTeam?.rpm_limit !== null && keyTeam?.rpm_limit !== undefined ? keyTeam?.rpm_limit : 'unlimited'}`}
rules={[
{
validator: async (_, value) => {
Expand Down

0 comments on commit a885d8b

Please sign in to comment.