Google Generative AI Python SDK 中 Pydantic 响应模式不一致问题解析

2025-07-03 01:57:13作者：江焘钦

The Google AI Python SDK enables developers to use Google's state-of-the-art generative AI models (like Gemini and PaLM) to build AI-powered features and applications.

项目地址：https://gitcode.com/gh_mirrors/ge/generative-ai-python

在 Google Generative AI Python SDK 的使用过程中，开发者发现当使用 Pydantic 模型定义响应模式时，与直接使用字典定义相比，存在明显的响应不一致问题。本文将深入分析这一现象的技术原因，并提供解决方案。

问题现象

开发者在使用 Gemini-1.5-flash 模型生成幻灯片内容时，定义了如下的 Pydantic 模型：

class SlideSchema(BaseModel):
    header: str
    subheader: str | None
    body: str | None
    image: str | None
    teachers_notes: str | None

当使用这个 Pydantic 模型作为响应模式时，成功率仅为 20% 左右。然而，当改用字典形式明确定义响应模式时，成功率提升至 100%。

技术分析

经过深入调查，发现问题的根源在于 SDK 内部对 Pydantic 模型的转换处理不够完善。具体表现为：

required 字段缺失：当 Pydantic 模型被转换为内部模式时，缺少了对必填字段的明确声明
nullable 处理不一致：虽然可空字段被正确标记为 nullable，但整体模式验证不够严格

通过检查 SDK 源码可以发现，Pydantic 模型最终被转换为如下结构：

{
    'items': {
        'properties': {
            'header': {'type': 'string'},
            'subheader': {'type': 'string', 'nullable': True},
            'body': {'type': 'string', 'nullable': True},
            'image': {'type': 'string', 'nullable': True},
            'teachers_notes': {'type': 'string', 'nullable': True}
        },
        'type': 'object'
    },
    'type': 'array'
}

而手动定义的字典模式则包含了更完整的验证信息，特别是包含了 required 字段的声明：

{
    'type': 'ARRAY',
    'items': {
        'type': 'OBJECT',
        'properties': {
            'header': {'type': 'STRING', 'nullable': False},
            'subheader': {'type': 'STRING', 'nullable': True},
            'body': {'type': 'STRING', 'nullable': True},
            'image': {'type': 'STRING', 'nullable': True},
            'teachers_notes': {'type': 'STRING', 'nullable': True}
        },
        'required': ['header', 'subheader', 'body', 'image', 'teachers_notes']
    }
}