<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Token on 黄文卓 | DevOps Engineer</title>
    <link>https://socake.github.io/tags/token/</link>
    <description>Recent content in Token on 黄文卓 | DevOps Engineer</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>zh-CN</language>
    <managingEditor>17691281867@163.com (Wenzhuo Huang)</managingEditor>
    <webMaster>17691281867@163.com (Wenzhuo Huang)</webMaster>
    <copyright>© 2026 Wenzhuo Huang</copyright>
    <lastBuildDate>Mon, 19 Jan 2026 13:03:00 +0800</lastBuildDate><atom:link href="https://socake.github.io/tags/token/index.xml" rel="self" type="application/rss+xml" />
    
    <item>
      <title>LLM 成本优化实战：从 Token 预算到模型路由</title>
      <link>https://socake.github.io/posts/llm-cost-optimization/</link>
      <pubDate>Mon, 19 Jan 2026 13:03:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/llm-cost-optimization/</guid>
      <description>我们的 AI 功能上线第一个月，LLM API 账单是 $18,000。通过模型路由、Prompt Caching 和 Batch API，第三个月降到了 $3,200。这篇文章记录具体怎么做到的。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/llm-cost-optimization/featured.jpg" />
    </item>
    
    <item>
      <title>大模型核心概念：工程师需要理解的 LLM 基础</title>
      <link>https://socake.github.io/posts/llm-core-concepts/</link>
      <pubDate>Mon, 17 Nov 2025 11:37:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/llm-core-concepts/</guid>
      <description>同事第一次用 GPT-4 API 写代码时问我：为什么我发了一段中文，token 消耗比英文多那么多？为什么模型有时候会一本正经地胡说八道？这篇文章把我认为工程师必须理解的 LLM 概念系统整理了一遍，不涉及 Transformer 数学，只讲对你写代码有帮助的部分。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/llm-core-concepts/featured.jpg" />
    </item>
    
  </channel>
</rss>
