Docker Model Runner Archives

Nov 20, 2025

Docker Model Runner Integrates vLLM for High-Throughput Inference

New: vLLM in Docker Model Runner. High-throughput inference for safetensors models with auto engine routing for NVIDIA GPUs using Docker.

Read now

Docker + Unsloth: Build Custom Models, Faster

Building and Running Custom Models Is Still Hard Running AI models locally is still hard. Even as open-source LLMs grow more capable, actually getting them to run on your machine, with the right dependencies, remains slow, fragile, and inconsistent. There’s two sides to this challenge: Model creation and optimization: making fine-tuning and quantization efficient. Model…

Srini Sekaran

Read now

Nov 3, 2025

How to Use Multimodal AI Models With Docker Model Runner

Run multimodal AI models that understand text, images, and audio with Docker Model Runner. Explore CLI and API examples, run Hugging Face models, and try a real-time webcam vision demo.

Ignasi Lopez Luna

Read now

Oct 31, 2025

Mr. Bones: A Pirate-Voiced Halloween Chatbot Powered by Docker Model Runner

How to turn a Home Depot skeleton into a live, pirate-voiced chatbot using a local LLM and Docker Model Runner—fast, low-cost, and fully in character.

Mike Coleman

Read now

Oct 21, 2025

Introducing a Richer ”docker model run” Experience

New interactive prompt for docker model run: readline-style editing, history, multi-line input, and Ctrl+C to stop responses. Try it today!

Eric Curtin

Read now

Docker Captain Oct 20, 2025

Docker Model Runner Meets Open WebUI: A Simpler Way to Run Local AI Models

Use the Open WebUI extension for Docker Desktop to run local LLMs with Docker Model Runner—chat UI, file uploads, voice, and multi-model support in minutes.

Sergei Shitikov

Read now

Oct 14, 2025

Join Us in Revitalizing the Docker Model Runner Community!

Docker Model Runner is GA, now in all Docker versions with Vulkan support for nearly any GPU. Star, fork, and contribute in our unified repo.

Eric Curtin

Read now

Oct 13, 2025

Docker Model Runner on the new NVIDIA DGX Spark: a new paradigm for developing AI locally

We’re thrilled to bring NVIDIA DGX™ Spark support to Docker Model Runner. The new NVIDIA DGX Spark delivers incredible performance, and Docker Model Runner makes it accessible. With Model Runner, you can easily run and iterate on larger models right on your local machine, using the same intuitive Docker experience you already trust. In this…

Emily Casey

Read now