Adding ollama as a service to docker compose file #3

krasch · 2025-06-04T14:41:31Z

This PR adds ollama to the docker compose file, i.e. ollama is automatically started by docker-compose up

docker-compose.yml

krasch · 2025-06-04T14:46:30Z

With this PR, the readme structure does not really work that well anymore. ollama is a prerequisite only in the non-docker case and the troubleshooting advice also is specific to the non-docker case. Perhaps there should be two big "When not using docker" and "When using docker" sections?

Also there is some repetition with model names. And I as a user would actually like to get some guidance on when to use which model (e.g. when only having a CPU). Perhaps just give one example for each model and then have a table with recommended alternatives and in which situations they are suitable? This table could then be referenced both in the docker and non-docker sections

krasch · 2025-06-04T14:47:17Z

README.md

 docker-compose up
+
+# Install embedding model (required)
+docker exec -it ollama ollama pull nomic-embed-text


here the first ollama is the name of the container and the second is the name of the command. This is a bit cryptic, but hopefully understandable enough for users of this first version?

clstaudt · 2025-06-05T06:44:19Z

@krasch Thank you for the PR. Is this ready to merge, so that I can update the README?

krasch · 2025-06-05T07:00:04Z

@krasch Thank you for the PR. Is this ready to merge, so that I can update the README?

There are still two issues that need to be investigated.

Do we need a volume for the ollama service, see above?
For GPU support, do we need to add anything wrt nvidia-docker or does it just work out of the box? I have not yet tested this on a GPU machine.

If you want we can create Issues for these two and merge this one, so that you can update the README

clstaudt · 2025-06-05T08:10:02Z

@krasch This is why I was unsure about adding ollama to the docker setup. I guess the PR needs to wait until this is understood.

krasch · 2025-06-06T08:21:12Z

@krasch This is why I was unsure about adding ollama to the docker setup. I guess the PR needs to wait until this is understood.

I understand that this is what you are worried about, but I know 100% that it is possible, have done it before. just might need some additional things.

Just do whatever changes you need to make in the Readme and I will get this branch here synced later.

krasch · 2025-06-06T13:57:06Z

@clstaudt This is now ready to merge. I confirmed that ollama does indeed use the GPU in this setup.

I moved things around a bit in the Readme, please review

clstaudt · 2025-06-07T12:38:53Z

@krasch Thank you for moving this forward!

May be nitpicking, but why do I need to remember a more complicated docker command if I have a GPU? (A GPU is not required, but highly recommended to run this app smoothly.)

services:
  ollama:
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: 1
              capabilities:
                - gpu

What if I have a GPU, but it is not an NVIDIA GPU?

krasch · 2025-06-12T07:10:42Z

1. May be nitpicking, but why do I need to remember a more complicated docker command if I have a GPU?  (A GPU is not required, but highly recommended to run this app smoothly.)

Because I have not found an easier way to make that happen with docker compose, unfortunately there does not seem to be a way to simply set a flag "use gpu" on the command line.

What could be done to simplify for GPU users, is to move the content of docker-compose.gpu.yml directly into the main docker-compose.yml. Then GPU users would only need to run docker compose up. CPU users on the other hand would need to open the docker-compose.yml file and uncomment those lines, so it is much worse for CPU users. Let me know if you want me to make that change.

2. What if I have a GPU, but it is not an NVIDIA GPU?

There isn't really CUDA support for other GPUs, so these are usually not supported anyway. Deep learning happens on nvidia.

So I have been reading up a little on the Apple stuff, and I have found that what I have here in the PR will actually only work on Linux. https://chariotsolutions.com/blog/post/apple-silicon-gpus-docker-and-ollama-pick-two/, so perhaps it is best if you just close the PR.

Be aware though, that your main might not work for Linux users instead. The setup of accessing from within a docker container (your app) a system HTTP port (ollama) is a bit unusual and the last time I tried it, I could not make it work on linux. There is this answer https://stackoverflow.com/a/24326540 which I believe last time I tried and failed and then just refactored to do things in the standard manner (i.e. either all on docker or nothing).

clstaudt · 2025-06-14T13:32:47Z

There isn't really CUDA support for other GPUs, so these are usually not supported anyway. Deep learning happens on nvidia.

With Ollama, LLM inference works just fine on Apple Silicon GPUs - out of the box when Mac users install and start Ollama.app.

At this point I am not even sure that a Docker configuration simplifies anything for this application.

krasch · 2025-06-15T16:42:47Z

At this point I am not even sure that a Docker configuration simplifies anything for this application.

So that Windows and Linux and Mac users with older hardware can try out your application extremely quickly without having to install anything (if they already have docker).

But up to you, your application, your decision.

krasch commented Jun 4, 2025

View reviewed changes

docker-compose.yml Show resolved Hide resolved

krasch commented Jun 4, 2025

View reviewed changes

krasch force-pushed the ollama-in-docker-compose branch from 2e37997 to 87fcb5f Compare June 6, 2025 13:13

Add ollama service to docker compose

26cda89

krasch force-pushed the ollama-in-docker-compose branch from 87fcb5f to 26cda89 Compare June 6, 2025 13:16

Add volume and GPU configuration for ollama service

9c9cc24

krasch force-pushed the ollama-in-docker-compose branch from 7dd82d1 to 9c9cc24 Compare June 6, 2025 13:55

krasch changed the title ~~[WIP] Adding ollama as a service to docker compose file~~ Adding ollama as a service to docker compose file Jun 6, 2025

Merge branch 'main' into ollama-in-docker-compose

7ccff77

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding ollama as a service to docker compose file #3

Adding ollama as a service to docker compose file #3

Uh oh!

krasch commented Jun 4, 2025

Uh oh!

Uh oh!

krasch commented Jun 4, 2025

Uh oh!

krasch Jun 4, 2025

Uh oh!

clstaudt commented Jun 5, 2025

Uh oh!

krasch commented Jun 5, 2025

Uh oh!

clstaudt commented Jun 5, 2025

Uh oh!

krasch commented Jun 6, 2025

Uh oh!

krasch commented Jun 6, 2025

Uh oh!

clstaudt commented Jun 7, 2025 •

edited

Loading

Uh oh!

krasch commented Jun 12, 2025

Uh oh!

clstaudt commented Jun 14, 2025

Uh oh!

krasch commented Jun 15, 2025

Uh oh!

Uh oh!

Adding ollama as a service to docker compose file #3

Are you sure you want to change the base?

Adding ollama as a service to docker compose file #3

Uh oh!

Conversation

krasch commented Jun 4, 2025

Uh oh!

Uh oh!

krasch commented Jun 4, 2025

Uh oh!

krasch Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

clstaudt commented Jun 5, 2025

Uh oh!

krasch commented Jun 5, 2025

Uh oh!

clstaudt commented Jun 5, 2025

Uh oh!

krasch commented Jun 6, 2025

Uh oh!

krasch commented Jun 6, 2025

Uh oh!

clstaudt commented Jun 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krasch commented Jun 12, 2025

Uh oh!

clstaudt commented Jun 14, 2025

Uh oh!

krasch commented Jun 15, 2025

Uh oh!

Uh oh!

clstaudt commented Jun 7, 2025 •

edited

Loading