<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Grafana on 黄文卓 | DevOps Engineer</title>
    <link>https://socake.github.io/tags/grafana/</link>
    <description>Recent content in Grafana on 黄文卓 | DevOps Engineer</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>zh-CN</language>
    <managingEditor>17691281867@163.com (Wenzhuo Huang)</managingEditor>
    <webMaster>17691281867@163.com (Wenzhuo Huang)</webMaster>
    <copyright>© 2026 Wenzhuo Huang</copyright>
    <lastBuildDate>Sun, 12 Apr 2026 14:00:00 +0800</lastBuildDate><atom:link href="https://socake.github.io/tags/grafana/index.xml" rel="self" type="application/rss+xml" />
    
    <item>
      <title>OpenCost 实战：Kubernetes 成本可见性与多团队费用分摊</title>
      <link>https://socake.github.io/posts/opencost-kubernetes-cost-visibility/</link>
      <pubDate>Sun, 12 Apr 2026 14:00:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/opencost-kubernetes-cost-visibility/</guid>
      <description>Kubernetes 成本不透明是 FinOps 落地的最大障碍。本文通过 OpenCost 构建完整的成本可见性体系，涵盖部署集成、云厂商价格接入、按团队分摊、Grafana 看板、超预算告警和自动周报推送，提供可直接复用的配置。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/opencost-kubernetes-cost-visibility/featured.jpg" />
    </item>
    
    <item>
      <title>告警带图实战：Grafana Render &#43; 钉钉推送趋势图</title>
      <link>https://socake.github.io/posts/prometheus-alert-with-image/</link>
      <pubDate>Tue, 23 Dec 2025 09:54:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/prometheus-alert-with-image/</guid>
      <description>收到告警只有一行数字，还要登录 Grafana 才能看趋势图——这是告警体验最大的痛点之一。本文介绍如何将 Grafana Image Renderer 与 Alertmanager Webhook 结合，实现告警消息自动附带趋势图的完整方案。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/prometheus-alert-with-image/featured.jpg" />
    </item>
    
    <item>
      <title>Prometheus &#43; Grafana &#43; Loki 可观测性体系建设</title>
      <link>https://socake.github.io/docs/kubernetes/%E5%8F%AF%E8%A7%82%E6%B5%8B%E6%80%A7%E5%BB%BA%E8%AE%BE/</link>
      <pubDate>Mon, 08 Dec 2025 15:00:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/docs/kubernetes/%E5%8F%AF%E8%A7%82%E6%B5%8B%E6%80%A7%E5%BB%BA%E8%AE%BE/</guid>
      <description>记录在多套 K8s 集群上建立统一可观测性平台的实践经验，包含 Prometheus 采集配置、告警规则设计、Grafana Dashboard 组织方式，以及跨集群日志聚合的 Loki 部署方案。</description>
      
    </item>
    
    <item>
      <title>k6 压测实战：从脚本编写到性能分析</title>
      <link>https://socake.github.io/posts/k6-load-testing-practice/</link>
      <pubDate>Tue, 21 Oct 2025 12:44:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/k6-load-testing-practice/</guid>
      <description>压测不是跑一个脚本看能不能撑住，而是通过有设计的负载模型暴露系统瓶颈。本文记录了我用 k6 做生产级性能测试的完整实践：脚本设计、阈值配置、与 Grafana 集成，以及几个典型性能问题的定位过程。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/k6-load-testing-practice/featured.jpg" />
    </item>
    
    <item>
      <title>ELK 集群监控：用 Prometheus &#43; Grafana 监控 Elasticsearch 健康</title>
      <link>https://socake.github.io/posts/elk-prometheus-monitoring/</link>
      <pubDate>Wed, 08 Oct 2025 11:33:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/elk-prometheus-monitoring/</guid>
      <description>Kibana 内置的 Stack Monitoring 免费功能有限，告警媒介也受商业授权约束。我们最终选择 Prometheus + Grafana 方案监控 ELK 集群，这篇文章记录完整的落地过程和踩坑。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/elk-prometheus-monitoring/featured.jpg" />
    </item>
    
    <item>
      <title>SLO/SLI/Error Budget 从理论到落地：SRE 可靠性工程实战</title>
      <link>https://socake.github.io/posts/slo-sli-error-budget-practice/</link>
      <pubDate>Fri, 01 Aug 2025 13:37:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/slo-sli-error-budget-practice/</guid>
      <description>从 SLI 指标选取到 Error Budget 消耗速率告警，系统讲解 SRE 可靠性工程体系的落地实践，包括 Prometheus recording rules 计算 SLI、多窗口 burn rate 告警规则配置、SLO 违规复盘流程，以及与开发团队的协作策略。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/slo-sli-error-budget-practice/featured.jpg" />
    </item>
    
    <item>
      <title>OpenTelemetry 落地实践：统一采集 Traces、Metrics、Logs</title>
      <link>https://socake.github.io/posts/opentelemetry-practice/</link>
      <pubDate>Sun, 20 Jul 2025 11:41:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/opentelemetry-practice/</guid>
      <description>从为什么选 OpenTelemetry 讲起，给出 DaemonSet + Gateway 的 Collector 部署架构、关键配置和实际踩坑记录。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/opentelemetry-practice/featured.jpg" />
    </item>
    
    <item>
      <title>可观测性三支柱实战：Metrics/Logs/Traces 联动</title>
      <link>https://socake.github.io/posts/observability-three-pillars/</link>
      <pubDate>Mon, 14 Jul 2025 09:52:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/observability-three-pillars/</guid>
      <description>监控告诉你系统挂了，可观测性告诉你为什么挂。本文从三支柱的核心差异出发，讲透 Prometheus+Loki+Tempo 的联动排障流程，覆盖 OpenTelemetry 采集标准、Exemplar 原理与配置，以及可观测性建设的优先级策略。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/observability-three-pillars/featured.jpg" />
    </item>
    
    <item>
      <title>Grafana API 自动化：用代码管理 Dashboard、数据源和告警</title>
      <link>https://socake.github.io/posts/grafana-api-automation/</link>
      <pubDate>Tue, 18 Mar 2025 11:26:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/grafana-api-automation/</guid>
      <description>手动点 UI 管理 Grafana Dashboard 在多环境场景下是噩梦。用 API 把 Dashboard 代码化，实现版本控制和环境同步，才是正确姿势。本文提供完整的 Python 工具脚本和实战踩坑。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/grafana-api-automation/featured.jpg" />
    </item>
    
    <item>
      <title>可观测性建设：从 Prometheus 采集到 Grafana 告警联动</title>
      <link>https://socake.github.io/posts/prometheus-grafana/</link>
      <pubDate>Fri, 06 Dec 2024 09:30:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/prometheus-grafana/</guid>
      <description>可观测性不是装几个监控工具，而是让系统在出问题时能快速定位根因。这篇文章从采集架构到 PromQL 到告警路由，覆盖我们在生产环境中实际遇到的 cardinality 爆炸、告警噪音等问题。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/prometheus-grafana/featured.jpg" />
    </item>
    
  </channel>
</rss>
