Apr 21, 2026 · 3 min read · ai-agents Ollama on Kubernetes: Recreate Strategy and Single-GPU Deadlock Deploying Ollama on Kubernetes can lead to GPU deadlocks. Here's how to avoid them. ollamakubernetesgpu-deadlockrecreate-strategynvidia-runtimepvc-sizing