Label Studio 关键点标注数据转换为 YOLO 格式的完整指南

2025-05-10 21:21:24作者：薛曦旖Francesca

前言

在计算机视觉领域，关键点检测是一项重要的任务，广泛应用于姿态估计、目标跟踪等场景。Label Studio 作为一款流行的数据标注工具，提供了灵活的关键点标注功能。本文将详细介绍如何将 Label Studio 中的关键点标注数据转换为 YOLO 格式，以便用于 YOLOv8 等模型的训练。

关键点标注数据格式对比

Label Studio 格式特点

Label Studio 的关键点标注数据通常包含以下特征：

使用 JSON 格式存储标注信息
关键点坐标以百分比形式表示（0-100%）
每个关键点可以关联到特定的边界框（通过 parentID 或 relation）
支持为关键点定义不同的标签类别

YOLO 关键点格式要求

YOLO 的关键点格式具有以下规范：

每个图像对应一个文本文件
每行表示一个对象实例
格式包含：类别索引、边界框中心坐标、宽高、关键点坐标
所有坐标值归一化到 0-1 范围

数据转换的核心思路

转换过程主要包含以下几个关键步骤：

解析原始数据：读取 Label Studio 导出的 JSON 文件
建立关联关系：将关键点与对应的边界框进行匹配
坐标转换：将百分比坐标转换为归一化坐标
格式重组：按照 YOLO 要求的顺序组织数据
输出结果：生成 YOLO 格式的文本文件

详细转换方案

1. 配置 Label Studio 标注模板

为了获得最佳转换效果，建议使用以下标注模板配置：

<View>
  <Image name="image" value="$image"/>
  
  <RectangleLabels name="bbox" toName="image">
    <Label value="目标类别"/>
  </RectangleLabels>
  
  <KeyPointLabels name="keypoints" toName="image" smart="true">
    <Label value="关键点1"/>
    <Label value="关键点2"/>
    <!-- 其他关键点定义 -->
  </KeyPointLabels>
</View>

这种配置会自动建立关键点与边界框的父子关系，便于后续处理。

2. Python 转换脚本实现

以下是完整的转换脚本，包含详细注释：

import json
import os

def convert_labelstudio_to_yolo(ls_json_path, output_dir, class_map):
    """
    将Label Studio标注数据转换为YOLO关键点格式
    
    参数:
        ls_json_path: Label Studio导出的JSON文件路径
        output_dir: 输出目录
        class_map: 类别名称到索引的映射字典
    """
    # 创建输出目录
    os.makedirs(output_dir, exist_ok=True)
    
    with open(ls_json_path, 'r') as f:
        data = json.load(f)
    
    for task in data:
        # 获取图像基本信息
        image_path = task['data'].get('image') or task['data'].get('img')
        if not image_path:
            continue
            
        image_name = os.path.basename(image_path)
        txt_filename = os.path.splitext(image_name)[0] + '.txt'
        txt_path = os.path.join(output_dir, txt_filename)
        
        # 处理标注结果
        annotations = task.get('annotations', [])
        if not annotations:
            continue
            
        annotation = annotations[0]['result']
        
        # 存储边界框和关键点信息
        bboxes = {}
        keypoints = []
        
        # 首先收集所有边界框
        for item in annotation:
            if item['type'] == 'rectanglelabels':
                bbox_id = item['id']
                label = item['value']['rectanglelabels'][0]
                
                # 获取图像原始尺寸
                width = item['original_width']
                height = item['original_height']
                
                # 转换边界框坐标
                x = item['value']['x'] / 100
                y = item['value']['y'] / 100
                w = item['value']['width'] / 100
                h = item['value']['height'] / 100
                x_center = x + w/2
                y_center = y + h/2
                
                bboxes[bbox_id] = {
                    'class_idx': class_map.get(label, 0),
                    'x_center': x_center,
                    'y_center': y_center,
                    'width': w,
                    'height': h,
                    'keypoints': []
                }
        
        # 然后收集关键点并关联到边界框
        for item in annotation:
            if item['type'] == 'keypointlabels':
                # 通过parentID或relation关联
                parent_id = item.get('parentID')
                if not parent_id:
                    # 如果没有parentID，尝试从relation中获取
                    for rel in annotation:
                        if rel.get('type') == 'relation' and rel['to_id'] == item['id']:
                            parent_id = rel['from_id']
                            break
                
                if parent_id in bboxes:
                    # 转换关键点坐标
                    kp_x = item['value']['x'] / 100
                    kp_y = item['value']['y'] / 100
                    
                    # 添加到对应边界框的关键点列表
                    bboxes[parent_id]['keypoints'].extend([kp_x, kp_y, 2])  # 2表示可见
        
        # 生成YOLO格式内容
        yolo_lines = []
        for bbox in bboxes.values():
            line = [
                bbox['class_idx'],
                bbox['x_center'],
                bbox['y_center'],
                bbox['width'],
                bbox['height']
            ]
            line.extend(bbox['keypoints'])
            yolo_lines.append(' '.join(map(str, line)))
        
        # 写入文件
        with open(txt_path, 'w') as f:
            f.write('\n'.join(yolo_lines))

3. 脚本使用示例

# 定义类别映射
class_map = {
    'person': 0,
    'fish': 1
    # 添加其他类别...
}

# 执行转换
convert_labelstudio_to_yolo(
    ls_json_path='label_studio_export.json',
    output_dir='yolo_labels',
    class_map=class_map
)

常见问题与解决方案

关键点与边界框关联失败
- 检查标注时是否正确建立了父子关系
- 确保在Label Studio中使用了正确的标注模板
- 可以尝试手动拖动关键点到边界框内
坐标转换错误
- 确认原始JSON中包含original_width和original_height
- 检查坐标值是否在预期范围内(0-100)
类别映射缺失
- 确保class_map包含所有出现的类别
- 可以为未知类别设置默认值
多对象实例处理
- 脚本已支持多个边界框及其关联关键点
- 每个实例会生成独立的行