RunPod Serverless Worker for ComfyUI

This project implements a custom RunPod serverless worker that forwards requests from the RunPod platform to a ComfyUI server. It provides the required RunPod API endpoints (/run, /runsync, /health) while leveraging ComfyUI's powerful workflow execution capabilities.

Architecture

The solution consists of:

Custom Handler (src/handler.py) - Enhanced RunPod serverless API with validation and monitoring
Start Script (src/start.sh) - Production startup with symlinks to shared models
Docker Configuration (Dockerfile.runpod.serverless) - Builds the complete container image
Dependencies (requirements.txt) - Python packages for the handler

Key Features

✅ RunPod API Compliance: Implements /run, /runsync, and /health endpoints
✅ Workflow Submission: Accepts ComfyUI workflows in API format
✅ Image Upload Support: Handles base64-encoded input images
✅ Real-time Monitoring: Uses ComfyUI websocket API for job progress
✅ Flexible Output: Returns images as base64 or uploads to S3
✅ Error Handling: Comprehensive error reporting and logging
✅ Custom Node Support: Includes all necessary ComfyUI custom nodes

Quick Start

1. Build the Docker Image

This repository supports a two-image workflow to make Python-only changes fast:

Base image (:base-1.0): CUDA + OS deps + torch + ComfyUI + custom nodes (rare rebuild)
App image (e.g. :1.5.0): worker code + schemas + workflows (frequent rebuild)

The build entrypoint is build_custom.ps1.

Build (base/app split)

# Build base image (rare)
./build_custom.ps1 -Target base -ImageName "fcaldas/tabario.com" -BaseImageTag "base-1.0"

# Build app image (fast for python-only changes)
./build_custom.ps1 -Target app -ImageName "fcaldas/tabario.com" -BaseImageTag "base-1.0" -ImageTag "1.5.0"

Build + push

docker login

# Build + push base image
./build_custom.ps1 -Target base -ImageName "fcaldas/tabario.com" -BaseImageTag "base-1.0" -Push

# Build + push app image
./build_custom.ps1 -Target app -ImageName "fcaldas/tabario.com" -BaseImageTag "base-1.0" -ImageTag "1.5.0" -Push

Legacy single Dockerfile build

cd y:\projects\runpod-serverless

docker build `
  -f .\Dockerfile.runpod.serverless `
  -t fcaldas/tabario.com:1.3 `
  .

2. Push to Docker Registry

docker login
docker push fcaldas/tabario.com:1.3

3. Deploy to RunPod

Go to RunPod Console → Serverless → Templates
Create new template with:
- Container Image: fcaldas/tabario.com:1.3
- Container Disk: 100 GB
- GPU: 16-24 GB VRAM recommended
- Environment Variables:
  - CIVITAI_API_KEY=<token> when using Civitai-hosted models (only needed if you later enable runtime downloads)
Deploy the template as a serverless endpoint

Models via Network Volume

This container uses a RunPod Network Volume mounted at /runpod-volume. On startup, it creates symlinks from /runpod-volume/comfyui/models/<subfolder> to /comfyui/models/<subfolder> so ComfyUI can load shared models without downloading.

Ensure your volume contains the expected ComfyUI subfolders and files, e.g. vae, clip, unet, loras, checkpoints, etc.

API Usage

Submit Workflow (Synchronous)

curl -X POST \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d @test_input.json \
  https://api.runpod.ai/v2/YOUR_ENDPOINT_ID/runsync

Submit Workflow (Asynchronous)

curl -X POST \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d @test_input.json \
  https://api.runpod.ai/v2/YOUR_ENDPOINT_ID/run

Response Format

{
  "id": "sync-uuid-string",
  "status": "COMPLETED", 
  "output": {
    "images": [
      {
        "filename": "ComfyUI_00001_.png",
        "type": "base64",
        "data": "iVBORw0KGgoAAAANSUhEUg..."
      }
    ]
  },
  "delayTime": 123,
  "executionTime": 4567
}

Configuration

Environment Variables

Variable	Default	Description
`COMFY_HOST`	`127.0.0.1:8188`	ComfyUI server address
`COMFY_LOG_LEVEL`	`DEBUG`	ComfyUI logging level
`BUCKET_ENDPOINT_URL`	-	S3 endpoint for image uploads
`CIVITAI_API_KEY`	-	Required to download Civitai-hosted models (e.g., `t2i-chroma-anime`)

S3 Configuration (Optional)

To enable S3 uploads instead of base64 responses:

# Set these environment variables in your RunPod template
BUCKET_ENDPOINT_URL=https://s3.amazonaws.com
AWS_ACCESS_KEY_ID=your_access_key
AWS_SECRET_ACCESS_KEY=your_secret_key
BUCKET_NAME=your_bucket_name

Input Format

The handler supports multiple ComfyUI workflow templates via the comfyui_workflow_name input.

Workflow: Wan 2.2 Image-to-Video (I2V)

Use comfyui_workflow_name: "video_wan2_2_14B_i2v" (default). This workflow requires an input image and uses width, height, and length.

{
  "input": {
    "prompt": "A beautiful sunset over the ocean with waves crashing",
    "image": "data:image/png;base64,iVBORw0KGgoAAAANSUhEUg...",
    "width": 480,
    "height": 640,
    "length": 81,
    "comfyui_workflow_name": "video_wan2_2_14B_i2v"
  }
}

Workflow: Qwen Text-to-Image (T2I)

Use comfyui_workflow_name: "image_qwen_t2i". This workflow does not require image. The handler injects:

prompt into {{ IMAGE_PROMPT }}
width into {{ IMAGE_WIDTH }}
height into {{ IMAGE_HEIGHT }}

as defined in workflows/image_qwen_image_distill_official_comfyui.json (notably the EmptySD3LatentImage node).

{
  "input": {
    "prompt": "Slow cinematic push-in through the forest, 4K",
    "width": 720,
    "height": 1280,
    "comfyui_workflow_name": "image_qwen_t2i"
  }
}

Required Fields

prompt (string): Text prompt used by the selected workflow

Optional Fields

image (string): Base64-encoded input image (with or without data URI prefix). Required only for workflows that include {{ INPUT_IMAGE }} (e.g. Wan 2.2 I2V).
width (int): Output width in pixels (default: 480)
height (int): Output height in pixels (default: 640)
length (int): Number of frames to generate (default: 81). Used by video workflows.
comfyui_workflow_name (string): Workflow template key (default: video_wan2_2_14B_i2v). For Qwen T2I use image_qwen_t2i.

The handler will:

Load the selected workflow template (comfyui_workflow_name)
If the workflow contains {{ INPUT_IMAGE }}, upload the provided image to ComfyUI and inject the uploaded filename
Inject the prompt into the appropriate placeholder (e.g. {{ VIDEO_PROMPT }}, {{ POSITIVE_PROMPT }}, or {{ IMAGE_PROMPT }} depending on the workflow)
Inject width/height (and length for video workflows) into the workflow
Queue the workflow for processing

Custom Nodes Included

ComfyUI-GGUF (GGUF model support)
ComfyUI-Frame-Interpolation (Video interpolation)
ComfyUI-VideoHelperSuite (Video processing utilities)
ComfyUI_ExtraModels (Additional model formats)
ComfyUI-Unload-Model (Memory management)
ComfyUI-Easy-Use (Utility nodes)

Models Included

Stable Diffusion: v1-5-pruned-emaonly-fp16.safetensors
Wan 2.2: I2V models (High/Low noise GGUF)
VAEs: wan_2.1_vae.safetensors, wan2.2_vae.safetensors, qwen_image_vae.safetensors
Text Encoders: umt5-xxl-encoder-Q5_K_M.gguf, qwen_2.5_vl_7b_fp8_scaled.safetensors
LoRAs: Wan2.2 Lightning LoRAs
Upscalers: RealESRGAN_x2plus.pth
Frame Interpolation: rife47.pth

Adding Custom Models/Nodes

Modify Dockerfile.runpod.serverless:

# Add custom nodes
RUN comfy-node-install https://github.com/your-repo/custom-node

# Download models
RUN wget -O models/checkpoints/your_model.safetensors https://url-to-model

Troubleshooting

Common Issues

ComfyUI fails to start
- Check logs for model loading errors
- Verify GPU memory is sufficient
- Ensure all models downloaded successfully
Handler not responding
- Verify ComfyUI is accessible: curl http://127.0.0.1:8188/system_stats
- Check websocket connection: ws://127.0.0.1:8188/ws
S3 upload failures
- Verify AWS credentials and permissions
- Check bucket name and region
- Ensure endpoint URL is correct
Video workflows stuck IN_PROGRESS even though the mp4 exists in /comfyui/output
- ComfyUI history is keyed by the ComfyUI prompt_id (returned from POST /prompt), not the RunPod job id.
- To inspect history, query:
  - curl http://127.0.0.1:8188/history
  - curl http://127.0.0.1:8188/history/<prompt_id>
- With ComfyUI-VideoHelperSuite, final mp4 outputs may appear under outputs.<node>.gifs (even though the file is an .mp4). Older worker versions that only looked for images/videos in history could miss the mp4 and keep polling for “final assets”.

Debug Mode

Enable detailed logging:

# Set environment variable
COMFY_LOG_LEVEL=DEBUG

# Or modify Dockerfile
ENV COMFY_LOG_LEVEL=DEBUG

File Structure

runpod-serverless/
├── src/
│   ├── handler.py              # Enhanced RunPod handler with validation
│   └── start.sh                # Production startup script
├── Dockerfile.runpod.serverless # Docker build configuration
├── requirements.txt            # Python dependencies
├── test_input.json            # Example workflow for testing
├── save_base64_image.py        # Helper: decode base64 output image from JSON and save locally
└── README.md                  # This file

License

This project extends the RunPod worker-comfyui base image. Please refer to the respective licenses of the underlying components.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.windsurf		.windsurf
docs		docs
schemas		schemas
scripts		scripts
src		src
tests		tests
workflows		workflows
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile.app		Dockerfile.app
Dockerfile.base		Dockerfile.base
Dockerfile.custom		Dockerfile.custom
README.md		README.md
UPGRADE_CUDA13.md		UPGRADE_CUDA13.md
build_custom.sh		build_custom.sh
build_runpod_pod.md		build_runpod_pod.md
docker-commands.md		docker-commands.md
models-to-download.md		models-to-download.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

RunPod Serverless Worker for ComfyUI

Architecture

Key Features

Quick Start

1. Build the Docker Image

Build (base/app split)

Build + push

Legacy single Dockerfile build

2. Push to Docker Registry

3. Deploy to RunPod

Models via Network Volume

API Usage

Submit Workflow (Synchronous)

Submit Workflow (Asynchronous)

Response Format

Configuration

Environment Variables

S3 Configuration (Optional)

Input Format

Workflow: Wan 2.2 Image-to-Video (I2V)

Workflow: Qwen Text-to-Image (T2I)

Required Fields

Optional Fields

Custom Nodes Included

Models Included

Adding Custom Models/Nodes

Troubleshooting

Common Issues

Debug Mode

File Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages