<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>质量评估 :: 标签 :: x7peeps</title><link>https://x7peeps.com/tags/%E8%B4%A8%E9%87%8F%E8%AF%84%E4%BC%B0/index.html</link><description/><generator>Hugo</generator><language>zh-CN</language><lastBuildDate>Fri, 03 Jul 2026 07:21:56 +0000</lastBuildDate><atom:link href="https://x7peeps.com/tags/%E8%B4%A8%E9%87%8F%E8%AF%84%E4%BC%B0/index.xml" rel="self" type="application/rss+xml"/><item><title>LLM-as-Judge：原理、偏差分析与实战配置</title><link>https://x7peeps.com/AI/05-Agent%E8%AF%84%E6%B5%8B%E4%B8%8E%E8%B4%A8%E9%87%8F%E4%BF%9D%E9%9A%9C/LLM-as-Judge%E5%8E%9F%E7%90%86%E5%81%8F%E5%B7%AE%E5%88%86%E6%9E%90%E4%B8%8E%E5%AE%9E%E6%88%98%E9%85%8D%E7%BD%AE/index.html</link><pubDate>Fri, 03 Jul 2026 07:21:56 +0000</pubDate><guid>https://x7peeps.com/AI/05-Agent%E8%AF%84%E6%B5%8B%E4%B8%8E%E8%B4%A8%E9%87%8F%E4%BF%9D%E9%9A%9C/LLM-as-Judge%E5%8E%9F%E7%90%86%E5%81%8F%E5%B7%AE%E5%88%86%E6%9E%90%E4%B8%8E%E5%AE%9E%E6%88%98%E9%85%8D%E7%BD%AE/index.html</guid><description>LLM-as-Judge 工作原理 LLM-as-Judge 是一种利用大语言模型自身作为自动评估工具的范式——将 LLM 的输出交给另一个（通常更强的）LLM 进行质量判定。这种方法正在快速取代传统的人工评测和规则匹配，成为 Agent 系统质量保障的核心手段。</description></item></channel></rss>