Merge pull request #6790 from EnterpriseDB/release-2025-05-12a

djw-m · web-flow · commit 6c5c1c1bcc2b · 2025-05-12T09:29:41.000-04:00
Release 2025-05-12a
diff --git a/advocacy_docs/edb-postgres-ai/ai-accelerator/reference/index.mdx b/advocacy_docs/edb-postgres-ai/ai-accelerator/reference/index.mdx
@@ -21,6 +21,8 @@ navigation:
 * [aidb.create_model](models#aidbcreate_model)
 * [aidb.get_model](models#aidbget_model)
 * [aidb.delete_model](models#aidbdelete_model)
+* [aidb.get_hcp_models](models#aidbget_hcp_models)
+* [aidb.create_hcp_model](models#aidbcreate_hcp_model)
 * [aidb.encode_text](models#aidbencode_text)
 * [aidb.encode_text_batch](models#aidbencode_text_batch)
 * [aidb.decode_text](models#aidbdecode_text)
diff --git a/advocacy_docs/edb-postgres-ai/ai-accelerator/reference/models.mdx b/advocacy_docs/edb-postgres-ai/ai-accelerator/reference/models.mdx
@@ -13,8 +13,8 @@ This reference documentation for EDB Postgres AI - AI Accelerator Pipelines mode
 
 The `aidb.model_providers` table stores information about the model providers that are available.
 
-| Column               | Type     | Description                      |
-|----------------------|----------|----------------------------------|
+| Column               | Type     | Description                     |
+| -------------------- | -------- | ------------------------------- |
 | `server_name`        | name     | Name for the model server       |
 | `server_description` | text     | Description of the model server |
 | `server_options`     | text\[\] | Options for the model server    |
@@ -25,8 +25,8 @@ The `aidb.model_providers` table stores information about the model providers th
 Returns a list of all models in the registry and their configured options, including predefined models and user-created models.
 
 
-| Column     | Type  | Description                                    |
-|------------|-------|------------------------------------------------|
+| Column     | Type  | Description                                   |
+| ---------- | ----- | --------------------------------------------- |
 | `name`     | text  | User-defined name for the model               |
 | `provider` | text  | Name of the model provider                    |
 | `options`  | jsonb | Optional configuration for the model provider |
@@ -54,13 +54,13 @@ Creates a new model in the system by saving its name, provider, and optional con
 
 #### Parameters
 
-| Parameter             | Type    | Default     | Description                                                                                                 |
-|-----------------------|---------|-------------|-------------------------------------------------------------------------------------------------------------|
-| `name`                | text    |             | User-defined name for the model.                                                                            |
-| `provider`            | text    |             | Name of the model provider (as found in [aidb.model_providers](#aidbmodel_providers)).                      |
-| `config`              | jsonb   | '{}'::jsonb | Optional configuration for the model provider. May include model-specific parameters such as `model`, `url`, and TLS options (e.g., `tls_config`).|
-| `credentials`         | jsonb   | '{}'::jsonb | Optional credentials for the model provider.                                                                |
-| `replace_credentials` | boolean | false       | If true, replaces the credentials for the model provider. If false, the credentials aren't overwritten. |
+| Parameter             | Type    | Default     | Description                                                                                                                                        |
+| --------------------- | ------- | ----------- | -------------------------------------------------------------------------------------------------------------------------------------------------- |
+| `name`                | text    |             | User-defined name for the model.                                                                                                                   |
+| `provider`            | text    |             | Name of the model provider (as found in [aidb.model_providers](#aidbmodel_providers)).                                                             |
+| `config`              | jsonb   | '{}'::jsonb | Optional configuration for the model provider. May include model-specific parameters such as `model`, `url`, and TLS options (e.g., `tls_config`). |
+| `credentials`         | jsonb   | '{}'::jsonb | Optional credentials for the model provider.                                                                                                       |
+| `replace_credentials` | boolean | false       | If true, replaces the credentials for the model provider. If false, the credentials aren't overwritten.                                            |
 
 
 #### Example
@@ -110,14 +110,14 @@ Returns the configuration for a model in the registry.
 
 #### Parameters
 
-| Parameter    | Type | Default | Description        |
-|--------------|------|---------|--------------------|
+| Parameter    | Type | Default | Description       |
+| ------------ | ---- | ------- | ----------------- |
 | `model_name` | text |         | Name of the model |
 
 #### Returns
 
-| Column     | Type  | Description                                    |
-|------------|-------|------------------------------------------------|
+| Column     | Type  | Description                                   |
+| ---------- | ----- | --------------------------------------------- |
 | `name`     | text  | User-defined name for the model               |
 | `provider` | text  | Name of the model provider                    |
 | `options`  | jsonb | Optional configuration for the model provider |
@@ -139,8 +139,8 @@ Deletes a model from the registry.
 
 #### Parameters
 
-| Parameter    | Type | Default | Description        |
-|--------------|------|---------|--------------------|
+| Parameter    | Type | Default | Description       |
+| ------------ | ---- | ------- | ----------------- |
 | `model_name` | text |         | Name of the model |
 
 #### Example
@@ -156,18 +156,52 @@ __OUTPUT__
 
 #### Returns
 
-| Column                    | Type  | Description                                              |
-|---------------------------|-------|----------------------------------------------------------|
+| Column         | Type  | Description                                          |
+| -------------- | ----- | ---------------------------------------------------- |
 | `delete_model` | jsonb | The name, provider, and options of the deleted model |
 
+
+### `aidb.get_hcp_models`
+
+Gets models running on the hybrid control plane.
+
+#### Returns
+
+| Column  | Type | Description                                       |
+| ------- | ---- | ------------------------------------------------- |
+| `name`  | text | The name of the model instance running on the HCP |
+| `url`   | text | The API URL of the model running on the HCP       |
+| `model` | text | The name the model running on the HCP             |
+
+#### Example
+```sql
+SELECT * FROM  aidb.get_hcp_models();
+                 name                  |                                       url                                        |               model                
+---------------------------------------+----------------------------------------------------------------------------------+------------------------------------
+ llama-3-1-8b-instruct-1xgpu-g6        | http://llama-3-1-8b-instruct-1xgpu-g6-predictor.default.svc.cluster.local        | meta/llama-3.1-8b-instruct
+ llama-3-2-nv-embedqa-1b-v2            | http://llama-3-2-nv-embedqa-1b-v2-predictor.default.svc.cluster.local            | nvidia/llama-3.2-nv-embedqa-1b-v2
+ meta-nim-llama3-70b-instruct-8xgpu-g5 | http://meta-nim-llama3-70b-instruct-8xgpu-g5-predictor.default.svc.cluster.local | meta/llama3-70b-instruct
+(3 rows)
+```
+### `aidb.create_hcp_model`
+
+Creates a new model in the system by referencing a running instance in the HCP
+
+#### Parameters
+
+| Parameter        | Type | Default | Description                               |
+| ---------------- | ---- | ------- | ----------------------------------------- |
+| `name`           | text |         | User-defined name of the model            |
+| `hcp_model_name` | text |         | Name of the model instance running on HCP |
+
 ### `aidb.encode_text`
 
 Encodes text using a model, generating a vector representation of a given text input.
 
 #### Parameters
 
-| Parameter    | Type | Default | Description                       |
-|--------------|------|---------|-----------------------------------|
+| Parameter    | Type | Default | Description                      |
+| ------------ | ---- | ------- | -------------------------------- |
 | `model_name` | text |         | Name of the model to encode with |
 | `text`       | text |         | Text to encode                   |
 
@@ -177,10 +211,10 @@ Encodes a batch of text using a model, generating a vector representation of a g
 
 #### Parameters
 
-| Parameter    | Type     | Default | Description                       |
-|--------------|----------|---------|-----------------------------------|
+| Parameter    | Type     | Default | Description                      |
+| ------------ | -------- | ------- | -------------------------------- |
 | `model_name` | text     |         | Name of the model to encode with |
-| `text`       | text\[\] |         | Array of text to encode                   |
+| `text`       | text\[\] |         | Array of text to encode          |
 
 
 ### `aidb.decode_text`
@@ -189,15 +223,15 @@ Decodes text using a model, generating a vector representation of a given text i
 
 #### Parameters
 
-| Parameter    | Type | Default | Description                       |
-|--------------|------|---------|-----------------------------------|
+| Parameter    | Type | Default | Description                      |
+| ------------ | ---- | ------- | -------------------------------- |
 | `model_name` | text |         | Name of the model to decode with |
 | `text`       | text |         | Text to decode                   |
 
 #### Returns
 
 | Column        | Type | Description      |
-|---------------|------|------------------|
+| ------------- | ---- | ---------------- |
 | `decode_text` | text | The decoded text |
 
 ### `aidb.decode_text_batch`
@@ -207,14 +241,14 @@ Decodes a batch of text using a model, generating a representation of a given te
 #### Parameters
 
 | Parameter    | Type     | Default | Description                      |
-|--------------|----------|---------|----------------------------------|
+| ------------ | -------- | ------- | -------------------------------- |
 | `model_name` | text     |         | Name of the model to decode with |
 | `text`       | text\[\] |         | Array of text to decode          |
 
 #### Returns
 
 | Column        | Type | Description      |
-|---------------|------|------------------|
+| ------------- | ---- | ---------------- |
 | `decode_text` | text | The decoded text |
 
 ### `aidb.encode_image`
@@ -224,14 +258,14 @@ Encodes an image using a model, generating a vector representation of a given im
 #### Parameters
 
 | Parameter    | Type  | Default | Description                      |
-|--------------|-------|---------|----------------------------------|
+| ------------ | ----- | ------- | -------------------------------- |
 | `model_name` | text  |         | Name of the model to encode with |
 | `image`      | bytea |         | Image to encode                  |
 
 #### Returns
 
 | Column         | Type  | Description       |
-|----------------|-------|-------------------|
+| -------------- | ----- | ----------------- |
 | `encode_image` | bytea | The encoded image |
 
 ### `aidb.rerank_text`
@@ -241,15 +275,15 @@ Reranks text using a model, generating a vector representation of a given text i
 #### Parameters
 
 | Parameter    | Type     | Default | Description                                   |
-|--------------|----------|---------|-----------------------------------------------|
+| ------------ | -------- | ------- | --------------------------------------------- |
 | `model_name` | text     |         | Name of the model to rerank with              |
 | `query`      | text     |         | Query based on which the input will be ranked |
 | `input`      | text\[\] | \[\]    | Inputs to be ranked                           |
 
 #### Returns
 
 | Column        | Type             | Description                                |
-|---------------|------------------|--------------------------------------------|
+| ------------- | ---------------- | ------------------------------------------ |
 | `text`        | text             | The text from "input"                      |
 | `logit_score` | double precision | Score/rank of this text                    |
 | `id`          | int              | index that the text had in the input array |
diff --git a/advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/ai-accelerator_4.0.1_rel_notes.mdx b/advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/ai-accelerator_4.0.1_rel_notes.mdx
@@ -0,0 +1,33 @@
+---
+title: AI Accelerator - Pipelines 4.0.1 release notes
+navTitle: Version 4.0.1
+originalFilePath: advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/src/rel_notes_4.0.1.yml
+editTarget: originalFilePath
+---
+
+Released: 9 May 2025
+
+This is a minor release that includes a few bug fixes and enhancements to the knowledge base pipeline.
+
+## Highlights
+
+- Bug fixes and performance enhancements.
+- Simplified model integration for HCP.
+
+## Enhancements
+
+<table class="table w-100"><thead><tr><th>Description</th><th width="10%">Addresses</th></tr></thead><tbody>
+<tr><td><details><summary>Bug fixes for knowledge base and preparer pipeline schema handling.</summary><hr/><p>The knowledge base and preparer pipeline now support arbitrary Postgres schemas for source and destination tables/volumes.
+A bug prevented users for configuring explicit schemas via Postgres qualified identifiers (<code>schema.name</code>) when referencing source or destination tables/volumes in the create pipeline calls. This bug is now fixed.</p>
+</details></td><td></td></tr>
+<tr><td><details><summary>Bug fix for knowledge base result accuracy.</summary><hr/><p>A bug in the batch-processing code for knowledge base pipelines would, under certain circumstances, lead to inaccurate results during retrieval.
+This bug was introduced in the 4.0.0 release and is now fixed.</p>
+</details></td><td></td></tr>
+<tr><td><details><summary>Simplified model integration for HCP.</summary><hr/><p>External models, running on the HCP Model Serving infrastructure, can now be listed and integrated into AIDB with new helper functions <code>aidb.get_hcp_models()</code> and <code>aidb.create_hcp_model()</code>.</p>
+</details></td><td></td></tr>
+<tr><td><details><summary>Performance enhancement for embeddings processing with external models.</summary><hr/><p>When using external models for embeddings (e.g. with the nim_embeddings model adapter), AIDB performs additional API calls in order to probe the model service for the response type.
+These calls are now skipped in most situations to reduce the overhead to a minimum when running embedding processing. AIDB will now use actual results to determine the response type.</p>
+</details></td><td></td></tr>
+</tbody></table>
+
+
diff --git a/advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/index.mdx b/advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/index.mdx
@@ -4,6 +4,7 @@ navTitle: Release notes
 description: Release notes for EDB Postgres AI - AI Accelerator
 indexCards: none
 navigation:
+  - ai-accelerator_4.0.1_rel_notes
   - ai-accelerator_4.0.0_rel_notes
   - ai-accelerator_3.0.1_rel_notes
   - ai-accelerator_2.2.1_rel_notes
@@ -21,6 +22,7 @@ The EDB Postgres AI - AI Accelerator describes the latest version of AI Accelera
 
 | AI Accelerator version | Release Date |
 |---|---|
+| [4.0.1](./ai-accelerator_4.0.1_rel_notes) | 09 May 2025 |
 | [4.0.0](./ai-accelerator_4.0.0_rel_notes) | 05 May 2025 |
 | [3.0.1](./ai-accelerator_3.0.1_rel_notes) | 03 Apr 2025 |
 | [2.2.1](./ai-accelerator_2.2.1_rel_notes) | 13 Mar 2025 |
diff --git a/advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/src/rel_notes_4.0.1.yml b/advocacy_docs/edb-postgres-ai/ai-accelerator/rel_notes/src/rel_notes_4.0.1.yml
@@ -0,0 +1,41 @@
+# yaml-language-server: $schema=https://raw.githubusercontent.com/EnterpriseDB/docs/refs/heads/develop/tools/automation/generators/relgen/relnote-schema.json
+product: AI Accelerator - Pipelines
+version: 4.0.1
+date: 9 May 2025
+intro: |
+  This is a minor release that includes a few bug fixes and enhancements to the knowledge base pipeline.
+highlights: |
+   - Bug fixes and performance enhancements.
+   - Simplified model integration for HCP.
+relnotes:
+- relnote: Bug fixes for knowledge base and preparer pipeline schema handling.
+  details: |
+    The knowledge base and preparer pipeline now support arbitrary Postgres schemas for source and destination tables/volumes.
+    A bug prevented users for configuring explicit schemas via Postgres qualified identifiers (`schema.name`) when referencing source or destination tables/volumes in the create pipeline calls. This bug is now fixed.
+  jira: ""
+  addresses: ""
+  type: Enhancement
+  impact: Medium
+- relnote: Bug fix for knowledge base result accuracy.
+  details: |
+    A bug in the batch-processing code for knowledge base pipelines would, under certain circumstances, lead to inaccurate results during retrieval.
+    This bug was introduced in the 4.0.0 release and is now fixed.
+  jira: "AID-425"
+  addresses: ""
+  type: Enhancement
+  impact: Medium
+- relnote: Simplified model integration for HCP.
+  details: |
+    External models, running on the HCP Model Serving infrastructure, can now be listed and integrated into AIDB with new helper functions `aidb.get_hcp_models()` and `aidb.create_hcp_model()`.
+  jira: "AID-416"
+  addresses: ""
+  type: Enhancement
+  impact: Medium
+- relnote: Performance enhancement for embeddings processing with external models.
+  details: |
+    When using external models for embeddings (e.g. with the nim_embeddings model adapter), AIDB performs additional API calls in order to probe the model service for the response type.
+    These calls are now skipped in most situations to reduce the overhead to a minimum when running embedding processing. AIDB will now use actual results to determine the response type.
+  jira: "AID-424"
+  addresses: ""
+  type: Enhancement
+  impact: Medium