Label Studio ML后端开发：解决预测结果格式错误问题

2025-05-09 23:09:35作者：蔡丛锟

在Label Studio ML后端开发过程中，开发者经常会遇到预测结果格式不符合要求的问题。本文将深入分析这类问题的成因，并提供完整的解决方案。

问题现象分析

当使用自定义ML模型与Label Studio集成时，后端服务可能会报错："ML backend returns an incorrect response, results field must be a list with at least one item"。这个错误表明ML后端返回的预测结果格式不符合Label Studio的接口规范。

核心问题解析

Label Studio对ML后端返回的预测结果有严格的格式要求：

必须包含results字段
results字段必须是一个列表
列表至少要包含一个预测项

常见错误原因包括：

预测结果直接返回了检测结果而没有包装成Label Studio要求的格式
当模型没有检测到任何目标时，返回了空列表或None
结果字典中缺少必要的字段

解决方案实现

1. 基础模型类结构

创建一个继承自LabelStudioMLBase的基础类，确保初始化时正确设置标签配置：

from label_studio_ml.model import LabelStudioMLBase

class CustomDetector(LabelStudioMLBase):
    def __init__(self, **kwargs):
        self.labels = ["hornet", "nest"]  # 定义标签列表
        self.label_map = {0: "hornet", 1: "nest"}  # 类别映射
        self.label_config = {"labels": self.labels}  # 标签配置
        super().__init__(**kwargs)

2. 预测方法实现

预测方法需要正确处理各种边界情况：

def predict(self, tasks, **kwargs):
    predictions = []
    
    for task in tasks:
        try:
            # 获取图像并进行预测
            image_url = task['data'].get('image', '')
            img_path = self.get_local_path(image_url)
            
            if not img_path:
                predictions.append(self._create_default_result())
                continue
                
            # 执行模型预测
            detections = self.model(img_path)[0]
            
            if not hasattr(detections, "boxes") or len(detections.boxes) == 0:
                predictions.append(self._create_default_result())
                continue
                
            # 转换预测结果为Label Studio格式
            task_results = []
            for box in detections.boxes:
                result = self._convert_detection(box, detections.orig_shape)
                if result:
                    task_results.append(result)
                    
            # 确保至少返回一个结果
            if not task_results:
                task_results = [self._create_default_result()]
                
            predictions.append({"results": task_results})
            
        except Exception as e:
            predictions.append({"results": [self._create_default_result()]})
    
    return predictions

3. 辅助方法实现

def _convert_detection(self, box, image_size):
    """将检测框转换为Label Studio格式"""
    img_width, img_height = image_size[1], image_size[0]
    cls_id = int(box.cls.cpu().numpy())
    conf = float(box.conf.cpu().numpy())
    xyxy = box.xyxy.cpu().numpy()[0]
    
    # 坐标转换
    x_min, y_min, x_max, y_max = xyxy
    x = max(0, (x_min / img_width) * 100)
    y = max(0, (y_min / img_height) * 100)
    width = min(100, ((x_max - x_min) / img_width) * 100)
    height = min(100, ((y_max - y_min) / img_height) * 100)
    
    label_name = self.label_map.get(cls_id, "unknown")
    
    return {
        "from_name": "label",
        "to_name": "image",
        "type": "rectanglelabels",
        "original_width": img_width,
        "original_height": img_height,
        "image_rotation": 0,
        "value": {
            "rectanglelabels": [label_name],
            "x": x,
            "y": y,
            "width": width,
            "height": height
        },
        "score": conf
    }

def _create_default_result(self, image_size=(100, 100)):
    """创建默认结果，确保格式正确"""
    return {
        "from_name": "label",
        "to_name": "image",
        "type": "rectanglelabels",
        "original_width": image_size[0],
        "original_height": image_size[1],
        "image_rotation": 0,
        "value": {
            "rectanglelabels": [self.labels[0]],
            "x": 0,
            "y": 0,
            "width": 10,
            "height": 10
        },
        "score": 0.0
    }