Skip to content

Commit ed3391b

Browse files
Merge pull request #6541 from EnterpriseDB/release/2025-02-24a
Release: 2025-02-24a
2 parents 13146d9 + e84bf42 commit ed3391b

File tree

10 files changed

+105
-29
lines changed

10 files changed

+105
-29
lines changed

advocacy_docs/edb-postgres-ai/ai-accelerator/models/openai-api-compatibility.mdx

+4-7
Original file line numberDiff line numberDiff line change
@@ -22,8 +22,7 @@ The starting point for this process is creating a model. When you create a model
2222
select aidb.create_model(
2323
'my_local_ollama',
2424
'embeddings',
25-
'{"model":"llama3.1", "url":"http://llama.local:11434/v1/embeddings", "dimensions":2000}'::JSONB,
26-
'{"api_key":""}'::JSONB);
25+
'{"model":"llama3.1", "url":"http://llama.local:11434/v1/embeddings", "dimensions":2000}'::JSONB);
2726
```
2827

2928
### Model name and model provider
@@ -42,21 +41,19 @@ The next parameter is the configuration. This is a JSON string, which when expan
4241

4342
In this case, we are setting the model to [“llama3.3”](https://ollama.com/library/llama3.3), a relatively new and powerful model. Remember to run `ollama run llama3.3` to pull and start the model on the server.
4443

45-
The next json setting is the important one, overriding the endpoint that the aidb model will use.
44+
The next json setting is the important one, overriding the endpoint that the aidb model will use.
4645

4746
* Our server is running on a machine called `llama.local`.
4847
* It has port 11434 (the default port for Ollama) open to service requests over HTTP (not HTTPS in this case).
4948
* The path to the endpoint on the server `/v1/embeddings`; the same as OpenAI.
5049

51-
Putting those components together we get `[`http://llama.local:11434/v1/embeddings`](http://art.local:11434/v1/embeddings","api_key":"","dimensions":2000}'::JSONB)` as our end point.
50+
Putting those components together we get `http://llama.local:11434/v1/embeddings` as our end point.
5251

5352
The last JSON parameter in this example is “dimensions” which is a hint to the system about how many vector values to expect from the model. If we [look up llama3.3’s properties](https://ollama.com/library/llama3.3/blobs/4824460d29f2) we can see the `llama.embedding_length` value is 8192. The provider defaults to 1536 (with some hard-wired exceptions depending on model) but it doesn’t know about llama3.3's max. Another factor is [pgvector is limited to 2000 dimensions](https://github.com/pgvector/pgvector?tab=readme-ov-file#what-if-i-want-to-index-vectors-with-more-than-2000-dimensions). So we pass a dimension value of 2000 in the configuration, to get the maximum dimensions available with pgvector.
5453

5554
That completes the configuration parameter.
5655

57-
### Credentials
58-
59-
The last parameter is the credentials parameter, which is another JSON string. It’s usually used for carrying the `api_key` for the OpenAI service and any other necessary credential information. It is not part of the configuration and by being separate, it can be securely hidden from users with lesser permissions. For our ollama connection, we don’t need an `api_key`, but the model provider currently requires that one is specified. We can specify an empty string for the `api_key` to satisfy this requirement.
56+
If the endpoint requires an API key, that would be passed in the credentials parameter. As this is a local model, we don’t need to pass any credentials.
6057

6158
## Using the model
6259

advocacy_docs/edb-postgres-ai/ai-accelerator/models/supported-models/completions.mdx

+2-1
Original file line numberDiff line numberDiff line change
@@ -99,6 +99,7 @@ The following configuration settings are available for OpenAI models:
9999

100100
## Model credentials
101101

102-
The following credentials are required for these models:
102+
The following credentials may be required by the service providing these models:
103103

104104
* `api_key` - The API key to use for authentication.
105+

advocacy_docs/edb-postgres-ai/ai-accelerator/models/supported-models/embeddings.mdx

+1-1
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,6 @@ The following configuration settings are available for OpenAI models:
7676

7777
## Model credentials
7878

79-
The following credentials are required for OpenAI models:
79+
The following credentials may be required by the service providing these models:
8080

8181
* `api_key` - The OpenAI API key to use for authentication.

advocacy_docs/edb-postgres-ai/ai-accelerator/models/using-models.mdx

+14-14
Original file line numberDiff line numberDiff line change
@@ -46,20 +46,20 @@ You can also find out what model providers are available by running the followin
4646
```sql
4747
SELECT * FROM aidb.model_providers;
4848
__OUTPUT__
49-
server_name | server_options
50-
--------------------+----------------
51-
t5_local |
52-
embeddings |
53-
completions |
54-
openai_embeddings |
55-
openai_completions |
56-
nim_completions |
57-
nim_embeddings |
58-
nim_clip |
59-
nim_reranking |
60-
bert_local |
61-
clip_local |
62-
dummy |
49+
server_name | server_description | server_options
50+
--------------------+------------------------------------------------------------------------------------------------------+----------------
51+
t5_local | A simple language model, ideal for translation, summarization, and question answering. Runs locally. |
52+
embeddings | For any model that implements the OpenAI embeddings API. |
53+
completions | For any model that implements the OpenAI completions API. |
54+
openai_embeddings | For any embeddings model on the OpenAI platform |
55+
openai_completions | For any completions model on the OpenAI platform. |
56+
nim_completions | For any model that implements the OpenAI completions API on NVIDIA NIM. |
57+
nim_embeddings | For any model that implements the OpenAI embeddings API on NVIDIA NIM. |
58+
nim_clip | Vision/text model, ideal for encoding text and images, runs on NVIDIA NIM. |
59+
nim_reranking | Reranking model, runs on NVIDIA NIM. |
60+
bert_local | Simple language model, ideal for encoding text, runs locally. |
61+
clip_local | A vision/text model, ideal for encoding text and images, runs locally. |
62+
dummy | Provides fake data, ideal for testing. |
6363
(12 rows)
6464
```
6565

advocacy_docs/edb-postgres-ai/ai-accelerator/reference/models.mdx

+5-4
Original file line numberDiff line numberDiff line change
@@ -13,10 +13,11 @@ This section provides reference documentation for EDB Postgres AI AI Accelerator
1313

1414
The `aidb.model_providers` table stores information about the model providers that have been created in the database.
1515

16-
| Column | Type | Description |
17-
|----------------|----------|-------------------------------|
18-
| server_name | name | Name for the model server. |
19-
| server_options | text\[\] | Options for the model server. |
16+
| Column | Type | Description |
17+
|--------------------|----------|----------------------------------|
18+
| server_name | name | Name for the model server. |
19+
| server_description | text | Description of the model server. |
20+
| server_options | text\[\] | Options for the model server. |
2021

2122

2223
## Functions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
---
2+
title: AI Accelerator - Pipelines 2.1.2 release notes
3+
navTitle: Version 2.1.2
4+
originalFilePath: advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/src/rel_notes_2.1.2.yml
5+
editTarget: originalFilePath
6+
---
7+
8+
Released: 25 February 2025
9+
10+
In this maintenance release, we fix two bugs and improve the model provider listing output.
11+
12+
## Highlights
13+
14+
- Model provider output now contains descriptions of the providers.
15+
16+
## Enhancements
17+
18+
<table class="table w-100"><thead><tr><th>Description</th><th width="10%">Addresses</th></tr></thead><tbody>
19+
<tr><td><details><summary>Added model provider descriptions</summary><hr/><p>The model provider view <code>select * from aidb.model_providers;</code> now includes a description column detailing the purpose and capabilities of each model provider.</p>
20+
</details></td><td></td></tr>
21+
</tbody></table>
22+
23+
24+
## Bug Fixes
25+
26+
<table class="table w-100"><thead><tr><th>Description</th><th width="10%">Addresses</th></tr></thead><tbody>
27+
<tr><td><details><summary>Fixed the volume listing command</summary><hr/><p>The <code>aidb.list_volumes();</code> call is now functional. A bug prevented this call from succeeding (without affecting any other functionality).</p>
28+
</details></td><td></td></tr>
29+
<tr><td><details><summary>Fixed the search path handling</summary><hr/><p>Some internal commands used to append <code>aidb</code> to the search_path. This failed in certain cases depending on the contents of the existing search path. We changed the implementation to no longer modify the search_path in order to avoid this issue. This also means that the existing search_path no longer gets modified by AIDB.</p>
30+
</details></td><td></td></tr>
31+
<tr><td><details><summary>Credentials are now completely optional for OpenAI compatible models</summary><hr/><p>Previously the <code>aidb.create_model()</code> function required a key with an empty value to be passed for the api-key. It now allows you to omit the <code>credentials</code> parameter.</p>
32+
</details></td><td></td></tr>
33+
</tbody></table>
34+
35+

advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/index.mdx

+2
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@ navTitle: Release notes
44
description: Release notes for EDB Postgres AI - AI Accelerator
55
indexCards: none
66
navigation:
7+
- ai-accelerator_2.1.2_rel_notes
78
- ai-accelerator_2.1.1_rel_notes
89
- ai-accelerator_2.0.0_rel_notes
910
- ai-accelerator_1.0.7_rel_notes
@@ -17,6 +18,7 @@ The EDB Postgres AI - AI Accelerator describes the latest version of AI Accelera
1718

1819
| AI Accelerator version | Release Date |
1920
|---|---|
21+
| [2.1.2](./ai-accelerator_2.1.2_rel_notes) | 25 Feb 2025 |
2022
| [2.1.1](./ai-accelerator_2.1.1_rel_notes) | 03 Feb 2025 |
2123
| [2.0.0](./ai-accelerator_2.0.0_rel_notes) | 13 Jan 2025 |
2224
| [1.0.7](./ai-accelerator_1.0.7_rel_notes) | 10 Dec 2024 |
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
# yaml-language-server: $schema=https://raw.githubusercontent.com/EnterpriseDB/docs/refs/heads/develop/tools/automation/generators/relgen/relnote-schema.json
2+
product: AI Accelerator - Pipelines
3+
version: 2.1.2
4+
date: 25 February 2025
5+
intro: |
6+
In this maintenance release, we fix two bugs and improve the model provider listing output.
7+
highlights: |
8+
- Model provider output now contains descriptions of the providers.
9+
relnotes:
10+
- relnote: Added model provider descriptions
11+
details: |
12+
The model provider view `select * from aidb.model_providers;` now includes a description column detailing the purpose and capabilities of each model provider.
13+
jira: "AID-176"
14+
addresses: ""
15+
type: Enhancement
16+
impact: High
17+
- relnote: Fixed the volume listing command
18+
details: |
19+
The `aidb.list_volumes();` call is now functional. A bug prevented this call from succeeding (without affecting any other functionality).
20+
jira: "AID-300"
21+
addresses: ""
22+
type: Bug Fix
23+
impact: High
24+
- relnote: Fixed the search path handling
25+
details: |
26+
Some internal commands used to append `aidb` to the search_path. This failed in certain cases depending on the contents of the existing search path. We changed the implementation to no longer modify the search_path in order to avoid this issue. This also means that the existing search_path no longer gets modified by AIDB.
27+
jira: "AID-219"
28+
addresses: ""
29+
type: Bug Fix
30+
impact: High
31+
- relnote: Credentials are now completely optional for OpenAI compatible models
32+
details: |
33+
Previously the `aidb.create_model()` function required a key with an empty value to be passed for the api-key. It now allows you to omit the `credentials` parameter.
34+
jira: ""
35+
addresses: ""
36+
type: Bug Fix
37+
impact: Low
38+
39+
40+

install_template/templates/products/edb*plus/index.njk

+1-1
Original file line numberDiff line numberDiff line change
@@ -20,5 +20,5 @@ redirects:
2020

2121
{% block otherosinstall %}
2222
## Windows
23-
- [Windows Server 2019, 2022, and Windows 11](windows)
23+
- [Windows Server 2022 and Windows 11](windows)
2424
{% endblock otherosinstall %}

product_docs/docs/edb_plus/41/installing/index.mdx

+1-1
Original file line numberDiff line numberDiff line change
@@ -66,4 +66,4 @@ Select a link to access the applicable installation instructions:
6666

6767
## Windows
6868

69-
- [Windows Server 2019, 2022, and Windows 11](windows)
69+
- [Windows Server 2022 and Windows 11](windows)

0 commit comments

Comments
 (0)