PaddleOCR版面分析模型转ONNX格式的注意事项

2025-05-01 15:18:40作者：董灵辛Dennis

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

项目地址：https://gitcode.com/GitHub_Trending/pa/PaddleOCR

在PaddleOCR项目中使用版面分析功能时，将训练好的PaddlePaddle模型转换为ONNX格式是一个常见需求。本文将详细介绍这一转换过程中的关键步骤和注意事项，帮助开发者避免常见错误。

模型训练阶段

首先需要训练版面分析模型，使用以下命令：

python3 tools/train.py -c configs/picodet/legacy_model/application/layout_analysis/picodet_lcnet_x1_0_layout.yml --eval

这个命令会基于指定的配置文件训练一个版面分析模型，并在训练过程中进行评估。

模型导出阶段

训练完成后，需要将模型导出为推理格式。这里有一个关键点需要注意：

python3 tools/export_model.py -c configs/picodet/legacy_model/application/layout_analysis/picodet_lcnet_x1_0_layout.yml -o export.benchmark=True weights=output/picodet_lcnet_x1_0_layout/best_model --output_dir=output_inference/

特别注意：必须添加export.benchmark=True参数，否则后续转换为ONNX格式时会出现输入维度不匹配的问题。这是因为默认导出的模型会保留一些训练时的特性，而benchmark模式会优化模型结构使其更适合推理场景。

转换为ONNX格式

使用Paddle2ONNX工具将导出的模型转换为ONNX格式：

paddle2onnx --model_dir output_inference/picodet_lcnet_x1_0_layout --model_filename model.pdmodel --params_filename model.pdiparams --save_file ./onnx_inference_model/hehe_layout_model_dir/model.onnx --opset_version 11 --enable_onnx_checker True

参数说明：

--opset_version 11：指定ONNX算子集版本
--enable_onnx_checker True：启用ONNX模型检查器

在PPStructure中使用ONNX模型

最后，在PPStructure中加载ONNX格式的版面分析模型：

table_engine = PPStructure(
    show_log=False,
    ocr=True,
    layout=True,
    table=True,
    use_onnx=True,
    det_model_dir="./onnx_inference_model/det_onnx/model.onnx",
    rec_model_dir="./onnx_inference_model/rec_onnx/model.onnx",
    cls_model_dir="./onnx_inference_model/rec_onnx/model.onnx",
    table_model_dir="./onnx_inference_model/table_model_dir/model.onnx",
    layout_model_dir="./onnx_inference_model/hehe_layout_model_dir/model.onnx"
)

常见问题解决

如果在使用过程中遇到类似"ValueError: Required inputs (['scale_factor']) are missing from input feed (['image'])"的错误，通常是因为模型导出时没有设置export.benchmark=True参数。这个错误表明ONNX模型期望的输入与实际的输入不匹配。