<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>AI 工程 on 黄文卓 | DevOps Engineer</title>
    <link>https://socake.github.io/categories/ai-%E5%B7%A5%E7%A8%8B/</link>
    <description>Recent content in AI 工程 on 黄文卓 | DevOps Engineer</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>zh-CN</language>
    <managingEditor>17691281867@163.com (Wenzhuo Huang)</managingEditor>
    <webMaster>17691281867@163.com (Wenzhuo Huang)</webMaster>
    <copyright>© 2026 Wenzhuo Huang</copyright>
    <lastBuildDate>Mon, 06 Apr 2026 11:30:00 +0800</lastBuildDate><atom:link href="https://socake.github.io/categories/ai-%E5%B7%A5%E7%A8%8B/index.xml" rel="self" type="application/rss+xml" />
    
    <item>
      <title>AutoGen 多 Agent 协作实战：从 Group Chat 到生产落地</title>
      <link>https://socake.github.io/posts/autogen-multi-agent-practice/</link>
      <pubDate>Mon, 06 Apr 2026 11:30:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/autogen-multi-agent-practice/</guid>
      <description>AutoGen 把多 Agent 协作从玩具推向生产。本文讲清它的核心抽象 (Conversable Agent / Group Chat / 工具调用)，以及从 demo 到生产要处理的那些事。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/autogen-multi-agent-practice/featured.jpg" />
    </item>
    
    <item>
      <title>LiteLLM 网关实战：多模型统一接入、限流、成本追踪与故障切换</title>
      <link>https://socake.github.io/posts/litellm-gateway-proxy/</link>
      <pubDate>Thu, 02 Apr 2026 14:00:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/litellm-gateway-proxy/</guid>
      <description>LiteLLM 是 LLM 多模型接入的事实标准。本文讲清它的 Proxy 模式部署、Model Config、Virtual Key、Router Fallback、成本追踪和踩坑实录。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/litellm-gateway-proxy/featured.jpg" />
    </item>
    
    <item>
      <title>Unsloth 高效微调实战：单卡 QLoRA 的极致性能与内部原理</title>
      <link>https://socake.github.io/posts/unsloth-efficient-finetuning/</link>
      <pubDate>Sun, 22 Mar 2026 09:15:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/unsloth-efficient-finetuning/</guid>
      <description>Unsloth 用手写 Triton kernel 把单卡 LoRA 微调速度和显存压到极致。本文讲清 Unsloth 的原理、和 LLaMA Factory/TRL 的组合用法，以及真实使用的坑。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/unsloth-efficient-finetuning/featured.jpg" />
    </item>
    
    <item>
      <title>LLaMA Factory 微调工具链实战：从数据准备到 LoRA 合并的全流程</title>
      <link>https://socake.github.io/posts/llamafactory-finetuning/</link>
      <pubDate>Wed, 18 Mar 2026 11:20:00 +0800</pubDate>
      <author>17691281867@163.com (Wenzhuo Huang)</author>
      <guid>https://socake.github.io/posts/llamafactory-finetuning/</guid>
      <description>LLaMA Factory 把大模型微调的很多 trick 工程化了。本文按一个完整项目的节奏讲：数据、SFT、LoRA、DPO、合并、评估和常见坑。</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/llamafactory-finetuning/featured.jpg" />
    </item>
    
  </channel>
</rss>
