Spring AI项目中Chroma向量存储自动配置的租户与数据库问题解析

2025-06-10 14:14:29作者：董宙帆

背景介绍

在Spring AI项目的向量存储集成中，Chroma作为一款开源的向量数据库，提供了高效的向量相似度搜索能力。Spring AI通过自动配置机制简化了Chroma的集成过程，但在实际使用中发现了一个配置上的缺陷。

问题本质

在Spring AI的自动配置实现中，ChromaVectorStoreAutoConfiguration类负责创建Chroma向量存储的Bean。当前实现存在一个关键问题：虽然配置属性类ChromaVectorStoreProperties已经包含了tenantName(租户名)和databaseName(数据库名)的配置项，但在自动配置过程中，这些属性值并没有被实际应用到创建的ChromaVectorStore实例上。

技术细节分析

配置属性类：ChromaVectorStoreProperties类中定义了三个重要属性：
- collectionName(集合名)
- databaseName(数据库名)
- tenantName(租户名)
自动配置类：ChromaVectorStoreAutoConfiguration在创建vectorStore Bean时，仅使用了collectionName属性值，忽略了其他两个重要属性。
导致的结果：即使用户在application.yml中配置了正确的tenantName和databaseName，系统仍然会使用默认值"SpringAiTenant"和"SpringAiDatabase"，这显然不符合预期行为。

影响范围

这个问题会导致以下具体影响：

多租户支持失效：Chroma设计上支持多租户隔离，但自动配置的默认行为使得所有应用都使用同一个租户空间。
数据库隔离失效：在同一租户下，无法按照预期使用不同的数据库进行数据隔离。
URL构造错误：最终生成的API请求URL会包含错误的租户和数据库路径，如示例中的http://localhost:8008/api/v2/tenants/SpringAiTenant/databases/SpringAiDatabase/collections/knowledge。

解决方案

正确的实现应该将三个配置属性都应用到ChromaVectorStore的构造函数中。修复后的代码逻辑应该是：

@Bean
public VectorStore vectorStore(ChromaApi chromaApi, ChromaVectorStoreProperties storeProperties) {
    return new ChromaVectorStore(chromaApi, 
        storeProperties.getCollectionName(),
        storeProperties.getTenantName(),
        storeProperties.getDatabaseName());
}