Higress AI 缓存插件与 TextIn 向量服务对接实践

2025-06-09 00:28:33作者：廉彬冶Miranda

背景介绍

在现代 AI 应用架构中，向量检索和缓存是提升大模型应用性能的关键组件。Higress 作为阿里巴巴开源的云原生网关，其 AI 插件生态提供了与各类 AI 服务的深度集成能力。本文将详细介绍如何将 Higress 的 AI 缓存插件与 TextIn 向量服务进行对接，并分享在实践过程中遇到的技术问题与解决方案。

核心组件解析

1. Higress AI 插件架构

Higress 的 AI 插件体系主要包含两大核心组件：

AI Proxy 插件：负责与各类大模型 API 的对接和协议转换
AI Cache 插件：提供向量检索和缓存能力，支持多种向量数据库和嵌入模型

2. TextIn 向量服务

TextIn 提供的高质量文本嵌入服务，能够将文本转换为高维向量表示。其特点包括：

支持 1792 维的 Matryoshka 降维技术
提供稳定高效的 API 接口
适用于语义搜索、推荐系统等场景

配置实践详解

1. 基础环境搭建

通过 Docker Compose 部署 Higress 网关环境时，需要特别注意：

services:
  envoy:
    image: higress-registry.cn-hangzhou.cr.aliyuncs.com/higress/gateway:v2.0.2
    command: -c /etc/envoy/envoy.yaml --component-log_level wasm:debug
    volumes:
      - ./envoy.yaml:/etc/envoy/envoy.yaml
      - ./ai-cache.wasm:/etc/envoy/main.wasm
      - ./ai-proxy.wasm:/etc/envoy/ai.wasm

2. 关键配置说明

在 envoy.yaml 配置中，需要重点关注以下部分：

http_filters:
  - name: cache
    typed_config:
      value:
        config:
          configuration:
            value: |
              {
                "embedding": {
                  "type": "textin",
                  "serviceName": "textin.dns",
                  "textinAppId": "your_app_id",
                  "textinSecretCode": "your_secret",
                  "textinMatryoshkaDim": 1792
                },
                "vector": {
                  "type": "dashvector",
                  "serviceName": "dashvector.dns",
                  "collectionID": "your_collection",
                  "apiKey": "your_api_key"
                }
              }

3. 集群配置要点

每个外部服务都需要配置对应的集群：

clusters:
  - name: textin.dns
    type: STRICT_DNS
    load_assignment:
      endpoints:
        - lb_endpoints:
            - endpoint:
                address:
                  socket_address:
                    address: api.textin.com
                    port_value: 443
    transport_socket:
      name: envoy.transport_sockets.tls
      typed_config:
        "@type": type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.UpstreamTlsContext
        "sni": "api.textin.com"