Skip to content

Commit b63a7ff

Browse files
committed
docs: add doc for low cost deployment (closes #215)
1 parent 6f073d9 commit b63a7ff

File tree

2 files changed

+36
-0
lines changed

2 files changed

+36
-0
lines changed

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,7 @@
2727
- [Resources](#resources)
2828
- [FAQ](#faq)
2929
- [Troubleshooting](#troubleshooting)
30+
- [Reduce deployment costs](#reduce-deployment-costs)
3031
- [Note](#note)
3132

3233
[![Open in GitHub Codespaces](https://img.shields.io/static/v1?style=for-the-badge&label=GitHub+Codespaces&message=Open&color=brightgreen&logo=github)](https://github.com/codespaces/new?hide_repo_select=true&ref=main&repo=684521881&machine=standardLinux32gb&devcontainer_path=.devcontainer%2Fdevcontainer.json&location=WestUs2)
@@ -426,6 +427,10 @@ Here are the most common failure scenarios and solutions:
426427

427428
1. You're getting an error `401 Principal does not have access to API/Operation` while running the project locally or trying to deploy. That's likely because your environment variables include `AZURE_TENANT_ID`, `AZURE_CLIENT_ID` and `AZURE_CLIENT_SECRET`. You should either grant permissions to the related _Service Principal_ or remove these variables from your environment to ensure normal access. For more details, please refer to [Azure identity SDK](https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/identity/identity/README.md#defaultazurecredential).
428429

430+
### Reduce deployment costs
431+
432+
See [this doc for reducing costs](./docs/low-cost.md).
433+
429434
### Note
430435

431436
> Note: The documents used in this demo contain information generated using a language model (Azure OpenAI Service). The information contained in these documents is only for demonstration purposes and does not reflect the opinions or beliefs of Microsoft. Microsoft makes no representations or warranties of any kind, express or implied, about the completeness, accuracy, reliability, suitability or availability with respect to the information contained in this document. All rights reserved to Microsoft.

docs/low-cost.md

Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,31 @@
1+
# Reduce deployment costs
2+
3+
This AI RAG chat application is designed to be easily deployed using the Azure Developer CLI, which provisions the infrastructure according to the Bicep files in the `infra` folder. Those files describe each of the Azure resources needed, and configures their SKU (pricing tier) and other parameters. Many Azure services offer a free tier, but the infrastructure files in this project do *not* default to the free tier as there are often limitations in that tier.
4+
5+
However, if your goal is to minimize costs while prototyping your application, follow the steps below *before* running `azd up`. Once you've gone through these steps, return to the [deployment steps](../README.md#azure-deployment).
6+
7+
1. Log in to your Azure account using the Azure Developer CLI:
8+
9+
```shell
10+
azd auth login
11+
```
12+
13+
1. Create a new azd environment for the free resource group:
14+
15+
```shell
16+
azd env new
17+
```
18+
19+
Enter a name that will be used for the resource group.
20+
This will create a new folder in the `.azure` folder, and set it as the active environment for any calls to `azd` going forward.
21+
22+
1. Use the free tier of Azure AI Search:
23+
24+
```shell
25+
azd env set AZURE_SEARCH_SERVICE_SKU free
26+
```
27+
28+
Limitations:
29+
1. You are only allowed one free search service across all regions.
30+
2. The free tier does not support semantic ranker. Note that will generally result in [decreased search relevance](https://techcommunity.microsoft.com/blog/azure-ai-services-blog/azure-ai-search-outperforming-vector-search-with-hybrid-retrieval-and-ranking-ca/3929167).
31+

0 commit comments

Comments
 (0)