You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -42,21 +41,19 @@ The next parameter is the configuration. This is a JSON string, which when expan
42
41
43
42
In this case, we are setting the model to [“llama3.3”](https://ollama.com/library/llama3.3), a relatively new and powerful model. Remember to run `ollama run llama3.3` to pull and start the model on the server.
44
43
45
-
The next json setting is the important one, overriding the endpoint that the aidb model will use.
44
+
The next json setting is the important one, overriding the endpoint that the aidb model will use.
46
45
47
46
* Our server is running on a machine called `llama.local`.
48
47
* It has port 11434 (the default port for Ollama) open to service requests over HTTP (not HTTPS in this case).
49
48
* The path to the endpoint on the server `/v1/embeddings`; the same as OpenAI.
50
49
51
-
Putting those components together we get `[`http://llama.local:11434/v1/embeddings`](http://art.local:11434/v1/embeddings","api_key":"","dimensions":2000}'::JSONB)` as our end point.
50
+
Putting those components together we get `http://llama.local:11434/v1/embeddings` as our end point.
52
51
53
52
The last JSON parameter in this example is “dimensions” which is a hint to the system about how many vector values to expect from the model. If we [look up llama3.3’s properties](https://ollama.com/library/llama3.3/blobs/4824460d29f2) we can see the `llama.embedding_length` value is 8192. The provider defaults to 1536 (with some hard-wired exceptions depending on model) but it doesn’t know about llama3.3's max. Another factor is [pgvector is limited to 2000 dimensions](https://github.com/pgvector/pgvector?tab=readme-ov-file#what-if-i-want-to-index-vectors-with-more-than-2000-dimensions). So we pass a dimension value of 2000 in the configuration, to get the maximum dimensions available with pgvector.
54
53
55
54
That completes the configuration parameter.
56
55
57
-
### Credentials
58
-
59
-
The last parameter is the credentials parameter, which is another JSON string. It’s usually used for carrying the `api_key` for the OpenAI service and any other necessary credential information. It is not part of the configuration and by being separate, it can be securely hidden from users with lesser permissions. For our ollama connection, we don’t need an `api_key`, but the model provider currently requires that one is specified. We can specify an empty string for the `api_key` to satisfy this requirement.
56
+
If the endpoint requires an API key, that would be passed in the credentials parameter. As this is a local model, we don’t need to pass any credentials.
<tr><td><details><summary>Added model provider descriptions</summary><hr/><p>The model provider view <code>select * from aidb.model_providers;</code> now includes a description column detailing the purpose and capabilities of each model provider.</p>
<tr><td><details><summary>Fixed the volume listing command</summary><hr/><p>The <code>aidb.list_volumes();</code> call is now functional. A bug prevented this call from succeeding (without affecting any other functionality).</p>
28
+
</details></td><td></td></tr>
29
+
<tr><td><details><summary>Fixed the search path handling</summary><hr/><p>Some internal commands used to append <code>aidb</code> to the search_path. This failed in certain cases depending on the contents of the existing search path. We changed the implementation to no longer modify the search_path in order to avoid this issue. This also means that the existing search_path no longer gets modified by AIDB.</p>
30
+
</details></td><td></td></tr>
31
+
<tr><td><details><summary>Credentials are now completely optional for OpenAI compatible models</summary><hr/><p>Previously the <code>aidb.create_model()</code> function required a key with an empty value to be passed for the api-key. It now allows you to omit the <code>credentials</code> parameter.</p>
In this maintenance release, we fix two bugs and improve the model provider listing output.
7
+
highlights: |
8
+
- Model provider output now contains descriptions of the providers.
9
+
relnotes:
10
+
- relnote: Added model provider descriptions
11
+
details: |
12
+
The model provider view `select * from aidb.model_providers;` now includes a description column detailing the purpose and capabilities of each model provider.
13
+
jira: "AID-176"
14
+
addresses: ""
15
+
type: Enhancement
16
+
impact: High
17
+
- relnote: Fixed the volume listing command
18
+
details: |
19
+
The `aidb.list_volumes();` call is now functional. A bug prevented this call from succeeding (without affecting any other functionality).
20
+
jira: "AID-300"
21
+
addresses: ""
22
+
type: Bug Fix
23
+
impact: High
24
+
- relnote: Fixed the search path handling
25
+
details: |
26
+
Some internal commands used to append `aidb` to the search_path. This failed in certain cases depending on the contents of the existing search path. We changed the implementation to no longer modify the search_path in order to avoid this issue. This also means that the existing search_path no longer gets modified by AIDB.
27
+
jira: "AID-219"
28
+
addresses: ""
29
+
type: Bug Fix
30
+
impact: High
31
+
- relnote: Credentials are now completely optional for OpenAI compatible models
32
+
details: |
33
+
Previously the `aidb.create_model()` function required a key with an empty value to be passed for the api-key. It now allows you to omit the `credentials` parameter.
0 commit comments