fix: Atchernych/5673959 docs (#4482)

atchernych · web-flow · commit 16239add1763 · 2025-11-19T15:44:22.000-08:00
Signed-off-by: Anna Tchernych &lt;atchernych@nvidia.com&gt;
diff --git a/recipes/README.md b/recipes/README.md
@@ -176,12 +176,13 @@ For Llama-3-70B with vLLM (Aggregated), an example of integration with the Infer
 
 Follow to Follow [Deploy Inference Gateway Section 2](../deploy/inference-gateway/README.md#2-deploy-inference-gateway) to install GAIE. Then apply manifests.
 Update the containers.epp.image in the deployment file, i.e. llama-3-70b/vllm/agg/gaie/k8s-manifests/epp/deployment.yaml
+This should be the same image you have used for your deployment.
 
 ```bash
 export DEPLOY_PATH=llama-3-70b/vllm/agg/
 #DEPLOY_PATH=<model>/<framework>/<mode>/
 kubectl apply -R -f "$DEPLOY_PATH/gaie/k8s-manifests" -n "$NAMESPACE"
-
+```
 
 ### DeepSeek-R1 on GB200 (Multi-node)
 
diff --git a/recipes/llama-3-70b/vllm/agg/gaie/k8s-manifests/epp/deployment.yaml b/recipes/llama-3-70b/vllm/agg/gaie/k8s-manifests/epp/deployment.yaml
@@ -38,7 +38,7 @@ spec:
 
       containers:
         - name: epp
-          image: nvcr.io/nvstaging/ai-dynamo/dynamo-frontend:0.7.0rc2-amd64
+          image: nvcr.io/nvidia/ai-dynamo/vllm-runtime:my-tag
           imagePullPolicy: IfNotPresent
           resources:
             requests: