如何用Unsloth创建领域专用模型？医疗文本微调案例

2026-01-29 12:39:10作者：薛曦旖Francesca

在人工智能快速发展的今天，领域专用模型在医疗、法律等专业领域发挥着越来越重要的作用。Unsloth作为一款高效的微调工具，能够帮助开发者快速构建适用于特定领域的模型。本文将以医疗文本微调为例，详细介绍如何使用Unsloth创建医疗领域专用模型，让AI更好地服务于医疗行业。

为什么选择Unsloth进行领域模型微调？

Unsloth是一个专为大语言模型微调设计的工具，它具有以下优势：

高效性：Unsloth采用先进的优化技术，能够显著提高微调速度，减少训练时间和资源消耗。
易用性：提供简洁的API和详细的文档，即使是新手也能快速上手。
灵活性：支持多种模型和微调方法，可根据不同领域需求进行定制。
开源免费：完全开源，无需担心版权和费用问题。

准备工作：环境搭建与数据准备

在开始微调之前，需要先搭建Unsloth的运行环境并准备医疗领域的数据集。

环境搭建

首先，通过以下命令克隆Unsloth仓库并安装相关依赖：

git clone https://gitcode.com/GitHub_Trending/notebooks24/notebooks
cd notebooks
pip install unsloth

数据集准备

医疗领域的微调需要高质量的标注数据。我们可以使用医疗影像报告数据集，例如包含X光片、CT扫描等医学图像及其对应的诊断报告。在项目中，我们使用了Radiology_mini数据集，该数据集包含X光片、CT扫描和超声波图像及其专家描述。

微调步骤详解

1. 加载预训练模型

使用Unsloth提供的FastVisionModel加载预训练的视觉语言模型，例如Llama-3.2-11B-Vision-Instruct：

from unsloth import FastVisionModel

model, tokenizer = FastVisionModel.from_pretrained(
    "unsloth/Llama-3.2-11B-Vision-Instruct",
    load_in_4bit=True,
    use_gradient_checkpointing="unsloth"
)

2. 数据预处理

将医疗影像数据和对应的文本描述转换为模型需要的格式。例如，将数据组织成对话形式：

def convert_to_conversation(sample):
    conversation = [
        { "role": "user",
          "content" : [
            {"type" : "text",  "text"  : "You are an expert radiographer. Describe accurately what you see in this image."},
            {"type" : "image", "image" : sample["image"]}
          ]
        },
        { "role" : "assistant",
          "content" : [
            {"type" : "text",  "text"  : sample["caption"]}
          ]
        },
    ]
    return { "messages" : conversation }

converted_dataset = [convert_to_conversation(sample) for sample in dataset]

3. 配置训练参数

使用SFTTrainer进行训练配置，设置批次大小、学习率等参数：

from unsloth.trainer import UnslothVisionDataCollator
from trl import SFTTrainer, SFTConfig

trainer = SFTTrainer(
    model=model,
    tokenizer=tokenizer,
    data_collator=UnslothVisionDataCollator(model, tokenizer),
    train_dataset=converted_dataset,
    args=SFTConfig(
        per_device_train_batch_size=2,
        gradient_accumulation_steps=4,
        warmup_steps=5,
        max_steps=30,
        learning_rate=2e-4,
        logging_steps=1,
        optim="adamw_8bit",
        weight_decay=0.001,
        lr_scheduler_type="linear",
        seed=3407,
        output_dir="outputs",
        remove_unused_columns=False,
        dataset_text_field="",
        dataset_kwargs={"skip_prepare_dataset": True},
        max_length=2048,
    ),
)

4. 开始微调训练

运行训练命令，开始模型微调：

trainer.train()

模型评估与应用

微调完成后，需要对模型进行评估，确保其在医疗影像描述任务上的准确性。可以使用测试集对模型进行测试，检查模型生成的诊断报告是否准确、专业。

例如，输入一张X光片，模型能够准确描述其中的异常区域：

image = dataset[0]["image"]
instruction = "You are an expert radiographer. Describe accurately what you see in this image."

messages = [
    {"role": "user", "content": [
        {"type": "image"},
        {"type": "text", "text": instruction}
    ]}
]
input_text = tokenizer.apply_chat_template(messages, add_generation_prompt=True)
inputs = tokenizer(
    image,
    input_text,
    add_special_tokens=False,
    return_tensors="pt",
).to("cuda")

outputs = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))