SD-WebUI-ControlNet扩展中IP Adapter Face ID预处理器的图像裁剪问题分析

2025-05-12 01:37:23作者：邵娇湘

问题背景

在使用Stable Diffusion WebUI的ControlNet扩展时，特别是当配合IP Adapter Face ID预处理器使用时，用户发现了一个图像处理问题。当独立图像(independent image)的尺寸与修复(inpaint)区域图像尺寸不一致，且修复区域较小时，独立图像会被意外裁剪，导致面部识别功能失效。

技术原理

ControlNet扩展中的IP Adapter Face ID预处理器是专门用于面部特征提取和适配的模块。它通过深度学习模型分析输入图像中的面部特征，生成面部嵌入(face embedding)，用于指导图像生成过程。

在img2img(图像到图像)工作流程中，特别是当使用A1111掩码进行局部修复时，系统默认会对输入图像进行裁剪以匹配修复区域。这一优化设计在大多数ControlNet模块中能提高效率，但对于需要全局面部信息的Face ID预处理器却会造成问题。

问题表现

当同时满足以下条件时会出现问题：

使用独立图像作为ControlNet输入
该图像尺寸与修复图像尺寸不同
修复区域相对较小
启用了"Crop Input image with A1111 mask"选项

此时系统会错误地裁剪独立图像，导致预处理器无法检测到完整的面部信息，抛出"No face found in image"异常。

解决方案

临时解决方法

用户发现可以通过修改ControlNet扩展的源代码来解决问题。具体是在controlnet.py文件中，修改条件判断逻辑，将IP Adapter Face ID模块排除在图像裁剪逻辑之外，使其行为类似于reference模块。

修改前代码：

if ('reference' not in unit.module 
    and is_only_masked_inpaint 
    and (is_upscale_script or unit.inpaint_crop_input_image)):

修改后代码：

if ('reference' not in unit.module 
    and 'ip-adapter_face_id' not in unit.module 
    and is_only_masked_inpaint 
    and (is_upscale_script or unit.inpaint_crop_input_image)):