Ollama on 黄文卓 | DevOps Engineer

Ollama on 黄文卓 | DevOps Engineer https://socake.github.io/tags/ollama/ Recent content in Ollama on 黄文卓 | DevOps Engineer Hugo -- gohugo.io zh-CN 17691281867@163.com (Wenzhuo Huang) 17691281867@163.com (Wenzhuo Huang) © 2026 Wenzhuo Huang Mon, 30 Mar 2026 09:08:00 +0800 Ollama 在 K8s 上跑大模型：本地 LLM 的运维实践 https://socake.github.io/posts/ollama-kubernetes-llm/ Mon, 30 Mar 2026 09:08:00 +0800 17691281867@163.com (Wenzhuo Huang) https://socake.github.io/posts/ollama-kubernetes-llm/ 在 Kubernetes 上部署 Ollama 运行本地大模型，从 GPU 调度到 CPU 推理降级，再到运维场景的实际集成，记录完整的踩坑与实践过程。