Dorin Geman
Sr Software Engineer, Docker
More by Dorin
Run and Iterate on LLMs Faster with Docker Model Runner on DGX Station
Docker Model Runner now supports NVIDIA DGX Station GB300. Run bigger models with familiar Docker commands and effortless setup.
Read now
Docker Model Runner Brings vLLM to macOS with Apple Silicon
Run vLLM on your Mac with Docker Model Runner. The vllm-metal backend enables high-performance LLM inference on Apple Silicon with Metal GPU acceleration.
Read now
Run Claude Code Locally with Docker Model Runner
Get Claude Code working with Docker Model Runner—free, on-device, and private. Your cloud bill stays at $0.
Read now
Docker Model Runner now supports vLLM on Windows
Run vLLM with GPU acceleration on Windows using Docker Model Runner and WSL2. Fast AI inference is here.
Read now
Docker Model Runner Integrates vLLM for High-Throughput Inference
New: vLLM in Docker Model Runner. High-throughput inference for safetensors models with auto engine routing for NVIDIA GPUs using Docker.
Read now
Run and Iterate on LLMs Faster with Docker Model Runner on DGX Station
Docker Model Runner now supports NVIDIA DGX Station GB300. Run bigger models with familiar Docker commands and effortless setup.
Read now
Docker Model Runner Brings vLLM to macOS with Apple Silicon
Run vLLM on your Mac with Docker Model Runner. The vllm-metal backend enables high-performance LLM inference on Apple Silicon with Metal GPU acceleration.
Read now
Run Claude Code Locally with Docker Model Runner
Get Claude Code working with Docker Model Runner—free, on-device, and private. Your cloud bill stays at $0.
Read now
Docker Model Runner now supports vLLM on Windows
Run vLLM with GPU acceleration on Windows using Docker Model Runner and WSL2. Fast AI inference is here.
Read now
Docker Model Runner Integrates vLLM for High-Throughput Inference
New: vLLM in Docker Model Runner. High-throughput inference for safetensors models with auto engine routing for NVIDIA GPUs using Docker.
Read now