Install ComfyUI on Docker: Stable Diffusion via Visual Workflows

Table of Contents

The Messy Reality of Local AI Environments

Running Stable Diffusion locally often starts as a fun project but quickly turns into a DevOps nightmare. Most tutorials suggest a direct installation: clone a repo, create a virtual environment, and run a shell script. While this works for the first twenty minutes, it eventually breaks. You install a new Custom Node for ComfyUI, and suddenly your Python dependencies are conflicted, your CUDA version is mismatched, or a global library update wipes out your torch configuration.

I spent weeks troubleshooting environments every time I wanted to try a new model or workflow. Sharing a workflow with a colleague was nearly impossible because their local machine configuration never matched mine. The friction of maintaining a Python environment specifically for generative AI becomes a significant bottleneck when you want to focus on creativity or production-grade automation.

The Root Cause: Dependency Hell and GPU Bridging

The core issue isn’t ComfyUI itself; it’s the underlying stack. Stable Diffusion relies on a specific intersection of NVIDIA drivers, CUDA toolkits, PyTorch versions, and dozens of Python packages. ComfyUI’s greatest strength—its modularity via custom nodes—is also its architectural weakness for local installs. Each node might require different versions of libraries like insightface or opencv.

When you run this directly on your OS, you are tethering your AI tools to your system’s global state. If you update your system drivers or Python version, you risk breaking your entire image generation pipeline. Furthermore, mapping a physical GPU to a software layer requires precise configuration that standard virtual environments don’t fully isolate.

The Solution: Containerizing ComfyUI

To solve this, we move the entire stack into Docker. By using the NVIDIA Container Toolkit, we can pass the GPU’s power into an isolated container while keeping the host system clean. This approach ensures that the environment inside the container is identical every time it boots, regardless of what happens on your main OS.

I have applied this approach in production and the results have been consistently stable. It allows for easy backups, quick migrations between servers, and a “reset” button if a custom node installation goes sideways.

Step 1: Preparing the Host System

Before touching Docker, your host machine needs the correct NVIDIA drivers. You don’t need the full CUDA toolkit installed on the host (the container handles that), but the drivers must be present. On a Linux system (Ubuntu/Debian), ensure your drivers are up to date:

nvidia-smi

If that command returns your GPU details, you’re ready. Next, install the NVIDIA Container Toolkit. This is the bridge that allows Docker to “see” your hardware:

curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
  && curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \
    sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
    sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

Step 2: Designing the Docker Compose Configuration

Instead of running a long, messy docker run command, we use Docker Compose. This allows us to define our volumes (where models and outputs live) and environment variables in a clean YAML file. Create a directory named comfyui-docker and add a docker-compose.yml file:

services:
  comfyui:
    image: yanagiwara/comfyui-docker:latest-cuda12.1 # Or a similar reputable image
    container_name: comfyui
    runtime: nvidia
    environment:
      - NVIDIA_VISIBLE_DEVICES=all
    ports:
      - "8188:8188"
    volumes:
      - ./storage/models:/home/user/ComfyUI/models
      - ./storage/output:/home/user/ComfyUI/output
      - ./storage/input:/home/user/ComfyUI/input
      - ./storage/custom_nodes:/home/user/ComfyUI/custom_nodes
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: all
              capabilities: [gpu]
    restart: unless-stopped

This configuration maps local folders on your hard drive to the internal ComfyUI folders. This is critical: you don’t want your 5GB models stored inside the container’s ephemeral storage. If the container is deleted, your models and generated images remain safe in your ./storage directory.

Step 3: Launching and Initial Setup

Run the stack with a single command:

docker compose up -d

Once the image downloads and starts, open your browser and navigate to http://localhost:8188. You’ll see the default ComfyUI graph. However, it won’t work yet because you haven’t added a checkpoint model.

Move your .safetensors files (like SDXL or SD1.5) into the ./storage/models/checkpoints folder. Refresh the UI, and the model will appear in the “Load Checkpoint” node. By keeping the custom_nodes folder as a mapped volume, you can still use the ComfyUI Manager to install nodes, and they will persist even if you update the Docker image later.

Managing State and Custom Nodes

One common hurdle with Docker is handling Python requirements for new nodes. Most modern ComfyUI Docker images are set up to auto-install requirements found in a node’s requirements.txt upon startup. If you install a complex node that fails to load, simply restart the container:

docker compose restart comfyui

The logs will show you exactly which library is missing, allowing you to debug without polluting your host machine’s OS. This isolation is my favorite part of the workflow. If a specific node version breaks everything, I can just delete that specific folder in custom_nodes and I’m back to a clean state.

Final Thoughts

Moving ComfyUI to Docker transforms it from a brittle script into a reliable piece of infrastructure. You gain the ability to move your entire AI workspace between a local workstation and a cloud GPU instance by simply syncing the storage folder and the docker-compose.yml file. It removes the “it works on my machine” excuse and lets you focus on building complex, node-based workflows for Stable Diffusion without worrying about the next pip install breaking your day.