【亲测免费】 YOLOE: 实时目标检测与分割模型使用教程

2026-01-30 04:37:39作者：郦嵘贵Just

1. 项目介绍

YOLOE（ye）是一个高效、统一且开放的对象检测与分割模型。它能够像人眼一样，在不同的提示机制下，如文本、视觉输入以及无提示范式下，实现实时地“看到”任何东西。与传统YOLO系列模型相比，YOLOE克服了预定义类别的限制，提高了在开放场景下的适应性。它集成了检测和分割功能，并支持多种开放提示机制，实现了零推理和迁移开销。

2. 项目快速启动

环境准备

首先，需要创建一个Python虚拟环境并安装必要的依赖。

conda create -n yoloe python=3.10 -y
conda activate yoloe
pip install -r requirements.txt

或者，你也可以直接通过以下命令安装项目：

pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/CLIP
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/ml-mobileclip
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/lvis-api
pip install git+https://github.com/THU-MIG/yoloe.git
wget https://docs-assets.developer.apple.com/ml-research/datasets/mobileclip/mobileclip_blt.pt

模型预测

以下是一个使用YOLOE进行预测的基本示例：

# 导入YOLOE模型
from yoloe import YOLOE

# 创建模型实例
model = YOLOE('path/to/weights.pth')

# 使用模型进行预测
results = model.predict('path/to/image.jpg')

模型迁移

如果你需要进行模型迁移，可以按照以下步骤进行：

# 导入必要的库
from yoloe import YOLOE
from torchvision import transforms
from PIL import Image

# 加载模型
model = YOLOE('path/to/weights.pth')

# 加载图片
image = Image.open('path/to/image.jpg')

# 转换图片
transform = transforms.Compose([
    transforms.ToTensor(),
    transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225])
])
image_tensor = transform(image).unsqueeze(0)

# 使用模型进行预测
results = model(image_tensor)