Labelme转YOLO：3步搞定目标检测数据格式转换难题

2026-02-08 04:04:19作者：余洋婵Anita

Help converting LabelMe Annotation Tool JSON format to YOLO text file format. If you've already marked your segmentation dataset by LabelMe, it's easy to use this tool to help converting to YOLO format dataset.

项目地址：https://gitcode.com/gh_mirrors/la/Labelme2YOLO

在计算机视觉项目中，Labelme转YOLO格式转换是每个开发者都会遇到的必备技能。Labelme2YOLO工具能够快速高效地将Labelme标注数据转换为YOLO格式，让数据预处理变得简单快捷。本文将为你详细解析这一转换过程的核心要点。

🎯 准备工作与环境搭建

开始转换前，首先需要获取项目代码并配置运行环境：

git clone https://gitcode.com/gh_mirrors/la/Labelme2YOLO
cd Labelme2YOLO
pip install -r requirements.txt

项目依赖的核心包包括OpenCV、Pillow、scikit-learn等，这些库确保了图像处理和坐标转换的准确性。

🔄 数据转换的核心流程

第一步：整理原始标注数据

确保你的Labelme标注文件都存放在同一个目录下，文件结构应该清晰有序：

annotations/
├── image1.json
├── image2.json
├── image3.json
└── ...

每个JSON文件都包含了完整的图像信息和标注数据，这是转换的基础。

第二步：执行一键转换命令

使用简单的命令行即可完成格式转换：

python labelme2yolo.py --json_dir ./annotations --val_size 0.2

参数说明：

--json_dir：指定Labelme JSON文件所在目录
--val_size：设置验证集比例，0.2表示20%数据用于验证

第三步：验证转换结果质量

转换完成后，检查生成的YOLO格式数据集：

YOLODataset/
├── labels/
│   ├── train/
│   └── val/
├── images/
│   ├── train/
│   └── val/
└── dataset.yaml

📋 转换过程中的关键检查点

坐标归一化验证：确保所有YOLO坐标值都在0-1范围内 类别标签一致性：检查不同JSON文件中相同类别的标签名称是否一致 图像路径完整性：确认转换后的图像文件都能正常访问

🚀 高级功能与实用技巧

实例分割数据集转换

对于需要实例分割的项目，添加--seg参数即可：

python labelme2yolo.py --json_dir ./annotations --seg

大型数据集分批处理

处理数千个标注文件时，建议分批进行以避免内存问题：

import os
import shutil

# 分批处理逻辑
batch_size = 500
json_files = [f for f in os.listdir('annotations') if f.endswith('.json')]

for i in range(0, len(json_files), batch_size):
    batch_files = json_files[i:i+batch_size]
    temp_dir = f"temp_batch_{i//batch_size}"
    os.makedirs(temp_dir, exist_ok=True)
    
    for file in batch_files:
        shutil.copy(f"annotations/{file}", f"{temp_dir}/{file}")
    
    # 对每个批次执行转换
    os.system(f"python labelme2yolo.py --json_dir {temp_dir}")