<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
    <channel>
        <title>OldWang TechTalk</title>
        <link>https://www.illusionjourney.com/</link>
        <description>老王TechTalk</description>
        <lastBuildDate>Mon, 13 Apr 2026 14:41:05 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>https://github.com/jpmonette/feed</generator>
        <language>zh-CN</language>
        <copyright>All rights reserved 2026, 老王TechTalk</copyright>
        <item>
            <title><![CDATA[LLM-白泽🐲]]></title>
            <link>https://www.illusionjourney.com/article/baize</link>
            <guid>https://www.illusionjourney.com/article/baize</guid>
            <pubDate>Fri, 30 Jun 2023 00:00:00 GMT</pubDate>
            <description><![CDATA[用chatgpt生成多轮完整对话数据，而不是QA对，从而有更好的效果。同时使用chatgpt做了一个对齐方式代替人类的对齐方式（SDF，Self-Distillation with Feedback）]]></description>
            <content:encoded><![CDATA[<div id="notion-article" class="mx-auto overflow-hidden "><main class="notion light-mode notion-page notion-block-540b4fa7e80447f992c3852892aa2e77"><div class="notion-viewport"></div><div class="notion-collection-page-properties"></div><div class="notion-callout notion-gray_background_co notion-block-86ab54b3d2f8479ca789da19a7211b4e"><div class="notion-page-icon-inline notion-page-icon-span"><span class="notion-page-icon" role="img" aria-label="😀">😀</span></div><div class="notion-callout-text">用chatgpt生成多轮完整对话数据，而不是QA对，从而有更好的效果。同时使用chatgpt做了一个对齐方式代替人类的对齐方式（SDF，Self-Distillation with Feedback）</div></div><figure class="notion-asset-wrapper notion-asset-wrapper-image notion-block-5b0cf8d896f34acea5131c4535a44f53"><div style="position:relative;display:flex;justify-content:center;align-self:center;width:100%;max-width:100%;flex-direction:column;height:100%"><img style="object-fit:cover" src="https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2F672bec60-7592-4b77-9502-0af59cd0330b%2FUntitled.png?table=block&amp;id=5b0cf8d8-96f3-4ace-a513-1c4535a44f53&amp;t=5b0cf8d8-96f3-4ace-a513-1c4535a44f53&amp;width=1898&amp;cache=v2" alt="notion image" loading="lazy" decoding="async"/></div></figure><h2 class="notion-h notion-h1 notion-h-indent-0 notion-block-3fbf5f6166a7495395ab287ef82c8bf8" data-id="3fbf5f6166a7495395ab287ef82c8bf8"><span><div id="3fbf5f6166a7495395ab287ef82c8bf8" class="notion-header-anchor"></div><a class="notion-hash-link" href="#3fbf5f6166a7495395ab287ef82c8bf8" title="📝 方法"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">📝 方法</span></span></h2><div class="notion-text notion-block-b18aa13194304ee595b622ef4257c998">使用自我对话的方式来完成一个多轮对话，具体案例参考。需要一个seed（Quora和stackoverflow问题作为seed）,和一个标准模版来帮助chatgpt产生此次对话。</div><figure class="notion-asset-wrapper notion-asset-wrapper-image notion-block-2bbbc0d0cbcc42cca5847ba1d060f6c1"><div style="position:relative;display:flex;justify-content:center;align-self:center;width:100%;max-width:100%;flex-direction:column;height:100%"><img style="object-fit:cover" src="https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2F3a08ebbd-8b4a-4775-a8c1-c83d8f914915%2FUntitled.png?table=block&amp;id=2bbbc0d0-cbcc-42cc-a584-7ba1d060f6c1&amp;t=2bbbc0d0-cbcc-42cc-a584-7ba1d060f6c1&amp;width=2088&amp;cache=v2" alt="notion image" loading="lazy" decoding="async"/></div></figure><div class="notion-text notion-block-7a76695379894ecf81ce73f119758813">使用lora对所有线性层微调，使用chatgpt来蒸馏模型（让模型生成4个答案，然后用chatgpt来选一个最好的答案作为label），</div><h2 class="notion-h notion-h1 notion-h-indent-0 notion-block-33024b36e3824782bf7d378842b302f5" data-id="33024b36e3824782bf7d378842b302f5"><span><div id="33024b36e3824782bf7d378842b302f5" class="notion-header-anchor"></div><a class="notion-hash-link" href="#33024b36e3824782bf7d378842b302f5" title="🤗 可借鉴"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">🤗 可借鉴</span></span></h2><div class="notion-text notion-block-29be1e77116c4aa18d3abd40be8ba872">模型规模：</div><div class="notion-text notion-block-b660f5b241b04fe08c4b666108314e3d">7B,13B,30B</div><div class="notion-text notion-block-85669bb9a2ed4bc2845af86fabe3fe3f">V1（base），V1.5（两个chatGPT,减少prompt对对话长度的占用）,V2（加入SDF）版本模型</div><div class="notion-text notion-block-4157a3c774314664971f250c41c09e62">参数配置：</div><div class="notion-text notion-block-46f728a8a43242cd800d5054f2cba838">lora dim = 8</div><h2 class="notion-h notion-h1 notion-h-indent-0 notion-block-b1aff477a55f4e70a1569a20bb124ba9" data-id="b1aff477a55f4e70a1569a20bb124ba9"><span><div id="b1aff477a55f4e70a1569a20bb124ba9" class="notion-header-anchor"></div><a class="notion-hash-link" href="#b1aff477a55f4e70a1569a20bb124ba9" title="🧬 效果"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">🧬 效果</span></span></h2><figure class="notion-asset-wrapper notion-asset-wrapper-image notion-block-8bfb40aed02a4222ab531ccc22e17b9f"><div style="position:relative;display:flex;justify-content:center;align-self:center;width:100%;max-width:100%;flex-direction:column;height:100%"><img style="object-fit:cover" src="https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2F0da0e995-9f90-4a54-a378-a0defa0067da%2FUntitled.png?table=block&amp;id=8bfb40ae-d02a-4222-ab53-1ccc22e17b9f&amp;t=8bfb40ae-d02a-4222-ab53-1ccc22e17b9f&amp;width=968&amp;cache=v2" alt="notion image" loading="lazy" decoding="async"/></div></figure><h2 class="notion-h notion-h1 notion-h-indent-0 notion-block-000b8cf2662c4429af1e1f44b697e162" data-id="000b8cf2662c4429af1e1f44b697e162"><span><div id="000b8cf2662c4429af1e1f44b697e162" class="notion-header-anchor"></div><a class="notion-hash-link" href="#000b8cf2662c4429af1e1f44b697e162" title="📎 核心代码"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">📎 核心代码</span></span></h2><h4 class="notion-h notion-h3 notion-h-indent-1 notion-block-fde6187d06a948c492406a208d30ea03" data-id="fde6187d06a948c492406a208d30ea03"><span><div id="fde6187d06a948c492406a208d30ea03" class="notion-header-anchor"></div><a class="notion-hash-link" href="#fde6187d06a948c492406a208d30ea03" title="数据形式"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">数据形式</span></span></h4><figure class="notion-asset-wrapper notion-asset-wrapper-image notion-block-82090450dc93496fbe3b1031a058722a"><div style="position:relative;display:flex;justify-content:center;align-self:center;width:100%;max-width:100%;flex-direction:column;height:100%"><img style="object-fit:cover" src="https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2Fe9b2e513-94fd-44da-975d-2eb9a470fae0%2FUntitled.png?table=block&amp;id=82090450-dc93-496f-be3b-1031a058722a&amp;t=82090450-dc93-496f-be3b-1031a058722a&amp;width=2908&amp;cache=v2" alt="notion image" loading="lazy" decoding="async"/></div></figure><div class="notion-blank notion-block-ca36c9e3f99d408f9dba1c66e800edb8"> </div><div class="notion-blank notion-block-e8734055c7f94582a82cec38d6a6b309"> </div></main></div>]]></content:encoded>
        </item>
    </channel>
</rss>