Xinference项目中ChatGLM4Tokenizer的padding_side参数问题分析

2025-05-30 07:59:46作者：柯茵沙

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

项目地址：https://gitcode.com/GitHub_Trending/in/inference

问题背景

在Xinference项目中使用CodeGeeX4模型时，用户遇到了一个关于ChatGLM4Tokenizer的错误。当尝试通过API端点进行聊天补全时，系统返回了错误信息："ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"。

错误分析

这个错误表明在调用ChatGLM4Tokenizer的_pad方法时，传入了一个不被接受的参数padding_side。这通常发生在Hugging Face Transformers库的版本与模型tokenizer实现不兼容的情况下。

技术细节

Tokenizer功能：Tokenizer在自然语言处理中负责将文本转换为模型可理解的数字表示（token IDs）。padding_side参数通常控制填充(padding)的方向（左侧或右侧），这对于批处理输入序列很重要。
版本兼容性问题：较新版本的Transformers库可能对tokenizer的实现进行了修改，而CodeGeeX4模型基于的ChatGLM4Tokenizer可能还没有适配这些变更。
模型架构影响：CodeGeeX4是基于ChatGLM架构的代码生成模型，其tokenizer实现可能有特殊处理，不完全兼容标准Transformers接口。

解决方案

临时解决方案：可以尝试降级Transformers库版本到4.39.0到4.40.2之间，这些版本已知与模型兼容。
长期解决方案：等待模型提供方更新tokenizer实现，使其兼容最新版Transformers库。模型开发者需要调整tokenizer的_pad方法实现，以支持padding_side参数。