EnterpriseDB
diff --git a/‎advocacy_docs/edb-postgres-ai/ai-accelerator/capabilities.mdx
Lines changed: 68 additions & 0 deletions b/‎advocacy_docs/edb-postgres-ai/ai-accelerator/capabilities.mdx
Lines changed: 68 additions & 0 deletions
diff --git a/‎advocacy_docs/edb-postgres-ai/ai-accelerator/compatibility.mdx
Lines changed: 32 additions & 0 deletions b/‎advocacy_docs/edb-postgres-ai/ai-accelerator/compatibility.mdx
Lines changed: 32 additions & 0 deletions
diff --git a/‎advocacy_docs/edb-postgres-ai/ai-accelerator/gettingstarted/index.mdx
Lines changed: 194 additions & 0 deletions b/‎advocacy_docs/edb-postgres-ai/ai-accelerator/gettingstarted/index.mdx
Lines changed: 194 additions & 0 deletions
diff --git a/‎advocacy_docs/edb-postgres-ai/ai-ml/images/aidb-overview-withbackground.png renamed to ‎advocacy_docs/edb-postgres-ai/ai-accelerator/images/aidb-overview-withbackground.png b/‎advocacy_docs/edb-postgres-ai/ai-ml/images/aidb-overview-withbackground.png renamed to ‎advocacy_docs/edb-postgres-ai/ai-accelerator/images/aidb-overview-withbackground.png
diff --git a/‎advocacy_docs/edb-postgres-ai/ai-accelerator/index.mdx
Lines changed: 32 additions & 0 deletions b/‎advocacy_docs/edb-postgres-ai/ai-accelerator/index.mdx
Lines changed: 32 additions & 0 deletions
diff --git a/‎advocacy_docs/edb-postgres-ai/ai-accelerator/installing/complete.mdx
Lines changed: 42 additions & 0 deletions b/‎advocacy_docs/edb-postgres-ai/ai-accelerator/installing/complete.mdx
Lines changed: 42 additions & 0 deletions
diff --git a/‎advocacy_docs/edb-postgres-ai/ai-accelerator/installing/index.mdx
Lines changed: 15 additions & 0 deletions b/‎advocacy_docs/edb-postgres-ai/ai-accelerator/installing/index.mdx
Lines changed: 15 additions & 0 deletions
@@ -0,0 +1,68 @@
+---
+title: "Capabilities"
+navTitle: "Capabilities"
+description: "Capabilities of the EDB Postgres AI - AI Accelerator Pipelines."
+---
+
+## Pipeline Lifecycle
+
+This is a high-level overview of the lifecycle of a pipeline in the Pipelines system.
+
+### A storage location is created (optional)
+
+This step is optional and only needed for accessing data external storage.
+
+Data for processing can be stored within the database in a table or in an external storage location.
+If you want to use an external storage location, you must create a storage location to access the data.
+This storage location can be an S3 bucket or a local file system.
+
+The storage locations can be used to create a volume, suitable for a retriever to use to access the data it contains.
+
+### A model is registered
+
+A [model](models) is registered with the Pipelines system. This model can be a machine learning model, a deep learning model, or any other type of model that can be used for AI tasks.
+
+### A retriever is registered
+
+A retriever is registered with the Pipelines system. A retriever is a function that retrieves data from a table or volume and returns it in a format that can be used by the model.
+
+By default, a retriever only needs:
+
+* a name
+* the name of a registered model to use
+
+If the retriever is for a table, it also needs:
+
+* the name of the source table
+* the name of the column in the source table that contains the data
+* the data type of the column
+
+If, on the other hand, the retriever is for a volume, it needs:
+
+* the name of the volume
+* the name of the column in the volume that contains the data
+
+When a retriever is registered, by default it will create a vector table to store the embeddings of the data that is retrieved.
+This table will have a column to store the embeddings and a column to store the key of the data.
+
+The name of the vector table and the name of the vector column and the key column can be specified when the retriever is registered; this is useful if you are migrating to aidb and want to use an existing vector table.
+
+### Embeddings are created
+
+Embedding sees the data being retrieved from the source table or volume and encoded into a vector datatype. That vector data is then stored in the vector table.
+
+If the source table already has data/rows at the time where the retriever is created, then a manual "bulk embedding" call must be made. This generates the embeddings for all the existing data in the source table.
+
+Auto embedding can then be activated to keep the embeddings in sync going forward. Auto embedding uses Postgres triggers to detect insertions and updates to the source table and automatically generates embeddings for the new data.
+
+### Data is queried
+
+The embedded data can be queried using the retriever. The retriever can return the key to the data or the data itself, depending on the query. The data can be queried using a text query or an image query, depending on the type of data that is being retrieved.
+
+### Next steps
+
+While auto-embedding is enabled, the embeddings are always up-to-date and applications can use the retriever to query the data as needed.
+
+### Cleanup
+
+If the embeddings are no longer required, the retriever can be unregistered, the vector table can be dropped and the model can be unregistered too.
@@ -0,0 +1,32 @@
+---
+title: Compatibility
+navTitle: Compatibility
+description: Compatibility information for the EDB Postgres AI - AI Accelerator Pipelines.
+---
+
+## Supported platforms
+
+### aidb
+
+* Ubuntu 22.04LTS on X86/64
+* Debian 12 (Bookworm) on X86/64
+
+### pgfs
+
+* Ubuntu 22.04LTS on X86/64
+* Debian 12 (Bookworm) on X86/64
+
+## Not currently supported
+
+* Redhat/RHEL 9/8 on X86/64
+* ARM architectures
+* SLES
+* Debian before the current version 12
+* Ubuntu 24.04LTS
+* Non-Linux platforms
+
+## Supported PostgreSQL versions
+
+* EDB Postgres Advanced Server Version 14, 15, 16 and 17
+* EBD Postgres Extended Version 14, 15, 16 and 17
+* PostgreSQL 14, 15, 16 and 17
@@ -0,0 +1,194 @@
+---
+title: "Getting Started with Pipelines"
+navTitle: "Getting Started"
+description: "How to get started with AI Accelerator Pipelines."
+redirects:
+- /purl/aidb/gettingstarted
+---
+
+## Where to Start
+
+The best place to start is with the [Pipelines Overview](/edb-postgres-ai/ai-accelerator/overview) to get an understanding of what Pipelines is and how it works.
+
+## Installation
+
+Pipelines is included with the EDB Postgres AI - AI Accelerator suite of tools. To install Pipelines, follow the instructions in the [AI Accelerator Installation Guide](/edb-postgres-ai/ai-accelerator/installing).
+
+## Using Pipelines
+
+Once you have Pipelines installed, you can start using it to work with your data.
+
+Log in to your Postgres server and ensure the Pipelines extension is installed:
+
+```sql
+CREATE EXTENSION aidb CASCADE;
+```
+
+We'll be working solely with Postgres table data for this example, so we won't need to install the pgfs extension.
+
+Let's also create an example table to work with:
+
+```sql
+CREATE TABLE products (
+   id SERIAL PRIMARY KEY,
+   product_name TEXT NOT NULL,
+   description TEXT,
+   last_updated_at TIMESTAMP WITH TIME ZONE DEFAULT CURRENT_TIMESTAMP
+);
+__OUTPUT__
+CREATE TABLE
+```
+
+And let's insert some data:
+
+```sql
+INSERT INTO products (product_name, description) VALUES
+      ('Hamburger', 'A delicious combination of bread and meat'),
+      ('Cheesburger', 'Improving on a classic, the cheese brings favorite flavors'),
+      ('Fish n Chips', 'The fish is a little greasy and the chips do not help'),
+      ('Fries', 'Never sure about these on their own, needs seasoning'),
+      ('Burrito', 'Always ready for this parcel of edible wonder'),
+      ('Pizza', 'It is very much a staple, but the rolled dough with toppings does not inspire'),
+      ('Sandwich', 'The blandest of offerings, the sandwich is predominantly boring bread'),
+      ('Veggie Burger', 'The ultra-processed vegetable product in this is neither healthy nor delicious'),
+      ('Kebab', 'Maybe one of the great edible treats, sliced lamb, salad and crisp pitta');
+__OUTPUT__
+INSERT 0 9
+```
+
+So now we have a table with some data in it, food products and some very personal opinions about them.
+
+## Registering a Retriever
+
+The first step to using Pipelines with this data is to register a retriever. A retriever is a way to access the data in the table and use it in AI workflows.
+
+```sql
+select aidb.register_retriever_for_table('products_retriever', 't5', 'products', 'description', 'Text');
+__OUTPUT__
+ register_retriever_for_table
+------------------------------
+ products_retriever
+(1 row)
+```
+
+## Querying the retriever
+
+Now that we have a retriever registered, we can query it to get similar results based on the data in the table.
+
+```sql
+select * from aidb.retrieve_key('products_retriever','I like it',5);
+__OUTPUT__
+ERROR:  Query returned no data. Hint: The "products_retriever_vector" table is likely empty. Make sure the embeddings have been computed.
+```
+
+First, we haven't computed embeddings for our retriever yet. 
+The `products_retriever_vector` table is where aidb keeps the computed embeddings for the retriever. 
+Let's compute those embeddings now using `aidb.bulk_embedding`:
+
+```sql
+select aidb.bulk_embedding('products_retriever');
+__OUTPUT__
+INFO:  bulk_embedding_text found 9 rows in retriever products_retriever
+ bulk_embedding
+----------------
+
+(1 row)
+```
+
+Now we can query the retriever again:
+
+```sql
+select * from aidb.retrieve_key('products_retriever','I like it',4);
+__OUTPUT__
+ key |      distance
+-----+--------------------
+ 4   | 1.0369428080621286
+ 3   |   1.03737124138149
+ 2   | 1.0839594107837638
+ 5   | 1.0869412071766262
+(4 rows)
+```
+
+Now we have some results. The `key` column is the primary key of the row in the `products` table, and the `distance` column is the distance between the query and the result. The lower the distance, the more similar the result is to the query.
+
+What we really want is the actual matching text, not just the key. We can use `aidb.retrieve_text` for that:
+
+```sql
+select * from aidb.retrieve_text('products_retriever','I like it',4);
+__OUTPUT__
+ key |                           value                            |      distance
+-----+------------------------------------------------------------+--------------------
+ 4   | Never sure about these on their own, needs seasoning       | 1.0369428080621286
+ 3   | The fish is a little greasy and the chips do not help      |   1.03737124138149
+ 2   | Improving on a classic, the cheese brings favorite flavors | 1.0839594107837638
+ 5   | Always ready for this parcel of edible wonder              | 1.0869412071766262
+(4 rows)
+```
+
+Now we have the actual data from the table that matches the query.
+
+You may want the row data from the `products` table instead of the `products_retriever_vector` table. You can do that by joining the two tables:
+
+```sql
+ select * from aidb.retrieve_key('products_retriever','I like it',4) as a 
+            left join products as b 
+            on a.key=b.id;
+__OUTPUT__
+ key |      distance      | id | product_name |                        description                         |         last_updated_at
+-----+--------------------+----+--------------+------------------------------------------------------------+----------------------------------
+ 2   | 1.0839594107837638 |  2 | Cheesburger  | Improving on a classic, the cheese brings favorite flavors | 04-DEC-24 16:48:52.599806 +00:00
+ 3   |   1.03737124138149 |  3 | Fish n Chips | The fish is a little greasy and the chips do not help      | 04-DEC-24 16:48:52.599806 +00:00
+ 4   | 1.0369428080621286 |  4 | Fries        | Never sure about these on their own, needs seasoning       | 04-DEC-24 16:48:52.599806 +00:00
+ 5   | 1.0869412071766262 |  5 | Burrito      | Always ready for this parcel of edible wonder              | 04-DEC-24 16:48:52.599806 +00:00
+ (4 rows)
+```
+
+Now you have the actual data from the `products` table that matches the query and as you can see, the full power of Postgres is available to you to work with your AI workflows.
+
+## One more thing, auto-embedding
+
+As it stands vectors have been calculated for our data, but if we added data to the table it wouldn't be automatically embedded. The retriever would go out of sync.
+
+To keep the embeddings up to date, we can enable auto-embedding:
+
+```sql
+select aidb.enable_auto_embedding_for_table('products_retriever');
+__OUTPUT__
+  enable_auto_embedding_for_table
+---------------------------------
+
+(1 row)
+```
+
+Now, if we add data to the table, the embeddings will be automatically calculated. We can quickly test this:
+
+```sql
+INSERT INTO products (product_name, description) VALUES
+      ('Pasta', 'A carb-heavy delight that is always welcome, especially with a good sauce'),
+      ('Salad', 'Meh, it is what it is and it is not much. Occasionally saved by a good dressing');
+__OUTPUT__
+NOTICE:  Running auto embedding for retriever products. key: "10" content: "A carb-heavy delight that is always welcome, especially with a good sauce"
+NOTICE:  Running auto embedding for retriever products. key: "11" content: "Meh, it is what it is and it is not much. Occasionally saved by a good dressing"
+INSERT 0 2
+```
+
+
+```sql
+select * from aidb.retrieve_key('products_retriever','I like it',4) as a
+            left join products as b
+            on a.key=b.id;
+__OUTPUT__
+ key |      distance      | id | product_name |                                   description                                   |         last_updated_at
+-----+--------------------+----+--------------+---------------------------------------------------------------------------------+----------------------------------
+ 10  | 1.0351907976251493 | 10 | Pasta        | A carb-heavy delight that is always welcome, especially with a good sauce       | 04-DEC-24 17:09:44.97484 +00:00
+ 11  |  0.979874632270706 | 11 | Salad        | Meh, it is what it is and it is not much. Occasionally saved by a good dressing | 04-DEC-24 17:09:44.97484 +00:00
+ 3   |   1.03737124138149 |  3 | Fish n Chips | The fish is a little greasy and the chips do not help                           | 04-DEC-24 16:48:52.599806 +00:00
+ 4   | 1.0369428080621286 |  4 | Fries        | Never sure about these on their own, needs seasoning                            | 04-DEC-24 16:48:52.599806 +00:00
+(4 rows)
+```
+
+## Further reading
+
+In the [Models](../models) section, you can learn how to register more models with Pipelines, including external models from OpenAI API compatible services.
+
+In the [Retrievers](../retrievers) section, you can learn more about how to use retrievers with external data sources, local files or S3 storage, and how to use the retriever functions to get the data you need.
@@ -0,0 +1,32 @@
+---
+title: "EDB Postgres AI - AI Accelerator"
+navTitle: "AI Accelerator"
+directoryDefaults:
+  product: "EDB Postgres AI"
+  iconName: BrainCircuit
+indexCards: simple
+description: "All about the EDB Postgres AI - AI Accelerator suite of tools including Pipelines and PGvector."
+navigation:
+- overview
+- gettingstarted
+- "#Introducing Pipelines"
+- pipelines-overview
+- capabilities
+- limitations
+- compatibility
+- installing
+- "#Piplelines components"
+- models
+- retrievers
+- pgfs
+- "#Pipelines resources"
+- reference
+- rel_notes
+- licenses
+- "#Other components"
+- pgvector
+redirects:
+- /edb-postgres-ai/ai-ml/
+---
+
+As part of the EDB Postgres AI platform, Pipelines abstracts away the complexity of working with AI data. It transforms Postgres into a powerful platform for AI data management, as it combines vector search from PGvector with automation for complex AI workflows.
@@ -0,0 +1,42 @@
+---
+title: "Completing and verifying the extension installation"
+navTitle: "Completing the installation"
+description: "Completing and verifying the installation of the AI Database and File System extensions."
+---
+
+### Installing the AI Database extension
+
+The AI Database extension is an extension that provides a set of functions to run AI/ML models in the database. The extension is installed using the `CREATE EXTENSION` command.
+
+```sql
+ebd=# CREATE EXTENSION aidb CASCADE;
+NOTICE:  installing required extension "vector"
+CREATE EXTENSION
+edb=#
+```
+
+### Installing the File System extension
+
+The File System extension is an extension that provides a set of functions to interact with the file system from within the database. The extension is installed using the `CREATE EXTENSION` command.
+
+```sql
+edb=# create extension pgfs;
+CREATE EXTENSION
+```
+
+### Validating the installation
+
+You can check the extensions have been installed by running the `\dx` command in `psql`.
+
+```sql
+edb=# \dx
+__OUTPUT__
+                                     List of installed extensions
+       Name       | Version |   Schema   |                        Description
+------------------+---------+------------+------------------------------------------------------------
+ aidb             | 1.0.7   | aidb       | aidb: makes it easy to build AI applications with postgres
+ pgfs             | 1.0.4   | pgfs       | pgfs: enables access to filesystem-like storage locations
+ vector           | 0.8.0   | public     | vector data type and ivfflat and hnsw access methods
+```
+
+Typically, there will be other extensions listed in this view. The `aidb`, `pgfs`, and `vector` extensions should be listed.
@@ -0,0 +1,15 @@
+---
+title: "Installing AI Accelerator Pipelines"
+navTitle: "Installing"
+description: "How to install AI Accelerator Pipelines."
+navigation:
+- packages
+- complete
+---
+
+Pipelines is delivered as a set of extensions. Depending on how you are deploying Pipelines, these extensions may be installed by your deployment platform (such as EDB Cloud Service) or if you deploy your own Postgres server, you will need to install them manually.
+
+- [Manually installing pipelines packages](packages)
+
+Once the packages are installed, you can [complete the installation](complete) by activating the extensions within Postgres.
+