Skip to content

Commit 4e84c84

Browse files
enable istio as a provider + configuring destinationRule
Signed-off-by: greg pereira <[email protected]>
1 parent 32970c0 commit 4e84c84

File tree

3 files changed

+33
-2
lines changed

3 files changed

+33
-2
lines changed

config/charts/inferencepool/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ To install via the latest published chart in staging (--version v0 indicates la
1616
```txt
1717
$ helm install vllm-llama3-8b-instruct \
1818
--set inferencePool.modelServers.matchLabels.app=vllm-llama3-8b-instruct \
19-
--set provider.name=[none|gke] \
19+
--set provider.name=[none|gke|istio] \
2020
oci://us-central1-docker.pkg.dev/k8s-staging-images/gateway-api-inference-extension/charts/inferencepool --version v0
2121
```
2222

@@ -95,7 +95,7 @@ Use `--set inferencePool.modelServerType=triton-tensorrt-llm` to install for Tri
9595
$ helm install triton-llama3-8b-instruct \
9696
--set inferencePool.modelServers.matchLabels.app=triton-llama3-8b-instruct \
9797
--set inferencePool.modelServerType=triton-tensorrt-llm \
98-
--set provider.name=[none|gke] \
98+
--set provider.name=[none|gke|istio] \
9999
oci://us-central1-docker.pkg.dev/k8s-staging-images/gateway-api-inference-extension/charts/inferencepool --version v0
100100
```
101101

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
{{- if eq .Values.provider.name "istio" }}
2+
---
3+
{{- if .Values.istio.destinationRule.enabled }}
4+
apiVersion: networking.istio.io/v1beta1
5+
kind: DestinationRule
6+
metadata:
7+
name: {{ include "gateway-api-inference-extension.name" . }}
8+
spec:
9+
host: {{ .Values.istio.destinationRule.host | default (printf "%s.%s.svc.cluster.local" (include "gateway-api-inference-extension.name" .) .Release.Namespace) }}
10+
trafficPolicy:
11+
tls:
12+
mode: SIMPLE
13+
insecureSkipVerify: true
14+
{{- if .Values.istio.destinationRule.trafficPolicy.connectionPool }}
15+
connectionPool:
16+
{{- .Values.istio.destinationRule.trafficPolicy.connectionPool | toYaml | nindent 6 }}
17+
{{- end }}
18+
{{- end }}
19+
{{- end }}

config/charts/inferencepool/values.yaml

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -67,6 +67,7 @@ inferencePool:
6767
# This will soon be deprecated when upstream GW providers support v1, just doing something simple for now.
6868
targetPortNumber: 8000
6969

70+
# Options: ["gke", "istio", "none"]
7071
provider:
7172
name: none
7273

@@ -75,3 +76,14 @@ provider:
7576
gke:
7677
# Set to true if the cluster is an Autopilot cluster.
7778
autopilot: false
79+
80+
istio:
81+
destinationRule:
82+
enabled: true
83+
# Provide a way to override the default calculated host
84+
host: ""
85+
# Optional: Enables customization of the traffic policy
86+
trafficPolicy: {}
87+
# connectionPool:
88+
# http:
89+
# maxRequestsPerConnection: 256000

0 commit comments

Comments
 (0)