AWS Bedrock Runtime Converse API工具特性场景详解

2026-02-04 04:33:39作者：谭伦延

Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.md file below.

项目地址：https://gitcode.com/gh_mirrors/aw/aws-doc-sdk-examples

引言：重新定义AI应用开发范式

在人工智能应用开发领域，开发者经常面临一个核心挑战：如何让大语言模型（LLM）不仅能够生成文本，还能与外部工具和服务进行智能交互？AWS Bedrock Runtime Converse API的出现彻底改变了这一局面，它提供了一个统一的对话式接口，支持工具调用、流式响应和多模态交互，让开发者能够构建真正智能的应用系统。

本文将深入解析Converse API的核心特性，通过实际代码示例展示其强大功能，并探讨在不同业务场景下的最佳实践。

Converse API架构概览

AWS Bedrock Converse API采用统一的对话式接口设计，支持多种交互模式：

graph TB
    A[客户端应用] --> B[Converse API]
    B --> C[基础模型 Claude/Cohere/Llama等]
    B --> D[工具调用机制]
    B --> E[流式响应处理]
    B --> F[多模态内容支持]
    
    D --> G[外部API服务]
    D --> H[数据库系统]
    D --> I[业务逻辑服务]
    
    E --> J[实时内容流]
    F --> K[文本/图像/文档处理]

核心请求参数结构

{
  "modelId": "anthropic.claude-3-haiku-20240307-v1:0",
  "messages": [
    {
      "role": "user",
      "content": [{"text": "用户输入内容"}]
    }
  ],
  "system": [{"text": "系统提示词"}],
  "toolConfig": {
    "tools": [
      {
        "toolSpec": {
          "name": "工具名称",
          "description": "工具描述",
          "inputSchema": {
            "json": {
              "type": "object",
              "properties": {
                "参数名": {"type": "参数类型"}
              }
            }
          }
        }
      }
    ]
  },
  "inferenceConfig": {
    "maxTokens": 512,
    "temperature": 0.5,
    "topP": 0.9
  }
}

基础用法：文本对话示例

Python实现基础对话

import boto3
from botocore.exceptions import ClientError

# 创建Bedrock Runtime客户端
client = boto3.client("bedrock-runtime", region_name="us-east-1")

# 设置模型ID
model_id = "anthropic.claude-3-haiku-20240307-v1:0"

# 构建对话消息
conversation = [
    {
        "role": "user",
        "content": [{"text": "解释机器学习中的过拟合现象及其解决方案"}]
    }
]

try:
    # 调用Converse API
    response = client.converse(
        modelId=model_id,
        messages=conversation,
        inferenceConfig={"maxTokens": 512, "temperature": 0.5}
    )

    # 提取响应文本
    response_text = response["output"]["message"]["content"][0]["text"]
    print("模型响应:", response_text)

except ClientError as e:
    print(f"API调用错误: {e}")

JavaScript实现方案

import { BedrockRuntimeClient, ConverseCommand } from "@aws-sdk/client-bedrock-runtime";

const client = new BedrockRuntimeClient({ region: "us-east-1" });

const command = new ConverseCommand({
  modelId: "anthropic.claude-3-haiku-20240307-v1:0",
  messages: [
    {
      role: "user",
      content: [{ text: "请用JavaScript实现一个快速排序算法" }]
    }
  ],
  inferenceConfig: {
    maxTokens: 1000,
    temperature: 0.2
  }
});

try {
  const response = await client.send(command);
  console.log(response.output.message.content[0].text);
} catch (error) {
  console.error("调用失败:", error);
}

高级特性：工具调用实战

工具调用架构设计

Converse API的工具调用功能允许模型动态调用外部工具和服务，实现真正的智能交互：

sequenceDiagram
    participant User
    participant App
    participant Bedrock
    participant Tool

    User->>App: 发送请求
    App->>Bedrock: Converse API调用
    Bedrock->>Bedrock: 分析请求，识别工具需求
    Bedrock->>App: 返回工具调用请求
    App->>Tool: 执行工具调用
    Tool->>App: 返回工具结果
    App->>Bedrock: 发送工具结果
    Bedrock->>Bedrock: 整合结果生成最终响应
    Bedrock->>App: 返回完整响应
    App->>User: 展示最终结果

天气查询工具示例

import boto3
import requests
import json

class WeatherToolDemo:
    def __init__(self):
        self.client = boto3.client("bedrock-runtime", region_name="us-east-1")
        self.model_id = "anthropic.claude-3-sonnet-20240229-v1:0"
        
        # 系统提示词
        self.system_prompt = [{
            "text": """你是一个天气助手，专门使用Weather_Tool查询天气信息。
            根据用户提供的位置名称推断经纬度坐标，使用工具获取天气数据。
            温度使用摄氏度(华氏度)，风速使用公里/小时(英里/小时)。
            只使用提供的工具，不猜测或编造信息。"""
        }]
        
        # 工具配置
        self.tool_config = {
            "tools": [{
                "toolSpec": {
                    "name": "Weather_Tool",
                    "description": "获取指定位置的当前天气信息",
                    "inputSchema": {
                        "json": {
                            "type": "object",
                            "properties": {
                                "latitude": {"type": "number", "description": "纬度"},
                                "longitude": {"type": "number", "description": "经度"}
                            },
                            "required": ["latitude", "longitude"]
                        }
                    }
                }
            }]
        }

    def get_weather_data(self, latitude, longitude):
        """调用天气API获取实际数据"""
        url = f"https://api.open-meteo.com/v1/forecast"
        params = {
            "latitude": latitude,
            "longitude": longitude,
            "current": "temperature_2m,wind_speed_10m,relative_humidity_2m"
        }
        response = requests.get(url, params=params)
        return response.json()

    def run_conversation(self, user_message):
        conversation = [{"role": "user", "content": [{"text": user_message}]}]
        
        try:
            response = self.client.converse(
                modelId=self.model_id,
                messages=conversation,
                system=self.system_prompt,
                toolConfig=self.tool_config
            )
            
            # 处理工具调用
            if response["stopReason"] == "tool_use":
                tool_results = []
                for content in response["output"]["message"]["content"]:
                    if "toolUse" in content:
                        tool_use = content["toolUse"]
                        if tool_use["name"] == "Weather_Tool":
                            # 执行实际工具调用
                            weather_data = self.get_weather_data(
                                tool_use["input"]["latitude"],
                                tool_use["input"]["longitude"]
                            )
                            
                            tool_results.append({
                                "toolResult": {
                                    "toolUseId": tool_use["toolUseId"],
                                    "content": [{"json": weather_data}]
                                }
                            })
                
                # 发送工具结果回模型
                conversation.append({"role": "user", "content": tool_results})
                final_response = self.client.converse(
                    modelId=self.model_id,
                    messages=conversation,
                    system=self.system_prompt,
                    toolConfig=self.tool_config
                )
                
                return final_response["output"]["message"]["content"][0]["text"]
            
            else:
                return response["output"]["message"]["content"][0]["text"]
                
        except Exception as e:
            return f"错误: {str(e)}"

# 使用示例
demo = WeatherToolDemo()
result = demo.run_conversation("查询北京现在的天气情况")
print(result)

流式响应处理

实时流式输出实现

import boto3
import time

def stream_conversation():
    client = boto3.client("bedrock-runtime", region_name="us-east-1")
    model_id = "anthropic.claude-3-haiku-20240307-v1:0"
    
    conversation = [{
        "role": "user", 
        "content": [{"text": "详细解释神经网络的工作原理"}]
    }]
    
    try:
        # 使用流式Converse API
        streaming_response = client.converse_stream(
            modelId=model_id,
            messages=conversation,
            inferenceConfig={"maxTokens": 1000, "temperature": 0.3}
        )
        
        print("模型响应流:")
        full_response = ""
        
        for chunk in streaming_response["stream"]:
            if "contentBlockDelta" in chunk:
                text_delta = chunk["contentBlockDelta"]["delta"]["text"]
                print(text_delta, end="", flush=True)
                full_response += text_delta
                time.sleep(0.01)  # 模拟实时流效果
            
            elif "messageStart" in chunk:
                print("开始生成响应...")
            
            elif "messageStop" in chunk:
                print("\n\n响应生成完成")
                break
        
        return full_response
        
    except Exception as e:
        print(f"流式调用错误: {e}")

# 执行流式对话
stream_conversation()

多模态内容处理

图像和文档理解

import boto3
import base64

def analyze_image_with_text(image_path, question):
    """结合图像和文本进行分析"""
    client = boto3.client("bedrock-runtime", region_name="us-east-1")
    
    # 读取并编码图像
    with open(image_path, "rb") as image_file:
        image_data = base64.b64encode(image_file.read()).decode('utf-8')
    
    # 构建多模态消息
    messages = [
        {
            "role": "user",
            "content": [
                {
                    "image": {
                        "format": "jpeg",
                        "source": {"bytes": image_data}
                    }
                },
                {"text": question}
            ]
        }
    ]
    
    try:
        response = client.converse(
            modelId="anthropic.claude-3-sonnet-20240229-v1:0",
            messages=messages,
            inferenceConfig={"maxTokens": 500}
        )
        
        return response["output"]["message"]["content"][0]["text"]
    
    except Exception as e:
        return f"分析失败: {str(e)}"

# 使用示例
result = analyze_image_with_text("product.jpg", "描述这张图片中的产品特点")
print(result)

企业级应用场景

客户服务自动化系统

class CustomerServiceAgent:
    def __init__(self):
        self.client = boto3.client("bedrock-runtime", region_name="us-east-1")
        self.model_id = "anthropic.claude-3-opus-20240229-v1:0"
        
        self.tool_config = {
            "tools": [
                {
                    "toolSpec": {
                        "name": "QueryKnowledgeBase",
                        "description": "查询产品知识库",
                        "inputSchema": {
                            "json": {
                                "type": "object",
                                "properties": {
                                    "query": {"type": "string"}
                                },
                                "required": ["query"]
                            }
                        }
                    }
                },
                {
                    "toolSpec": {
                        "name": "CreateSupportTicket",
                        "description": "创建技术支持工单",
                        "inputSchema": {
                            "json": {
                                "type": "object",
                                "properties": {
                                    "issue_description": {"type": "string"},
                                    "customer_id": {"type": "string"},
                                    "priority": {"type": "string", "enum": ["low", "medium", "high"]}
                                },
                                "required": ["issue_description", "customer_id"]
                            }
                        }
                    }
                }
            ]
        }
        
        self.system_prompt = [{
            "text": """你是专业的客户服务助手，帮助客户解决产品相关问题。
            首先尝试使用知识库查询工具回答问题，如果无法解决则创建支持工单。
            保持友好专业的语气，准确理解客户问题。"""
        }]

    def handle_customer_query(self, customer_id, query):
        conversation = [{
            "role": "user", 
            "content": [{"text": f"客户{customer_id}咨询: {query}"}]
        }]
        
        max_iterations = 3
        for iteration in range(max_iterations):
            try:
                response = self.client.converse(
                    modelId=self.model_id,
                    messages=conversation,
                    system=self.system_prompt,
                    toolConfig=self.tool_config
                )
                
                if response["stopReason"] == "end_turn":
                    return response["output"]["message"]["content"][0]["text"]
                
                elif response["stopReason"] == "tool_use":
                    tool_results = []
                    for content in response["output"]["message"]["content"]:
                        if "toolUse" in content:
                            tool_use = content["toolUse"]
                            tool_result = self.execute_tool(tool_use)
                            tool_results.append({
                                "toolResult": {
                                    "toolUseId": tool_use["toolUseId"],
                                    "content": [{"json": tool_result}]
                                }
                            })
                    
                    conversation.append({"role": "user", "content": tool_results})
                
            except Exception as e:
                return f"处理过程中出现错误: {str(e)}"
        
        return "抱歉，无法处理您的请求，请联系人工客服"

    def execute_tool(self, tool_use):
        """实际执行工具调用"""
        if tool_use["name"] == "QueryKnowledgeBase":
            # 模拟知识库查询
            return {"answer": "根据知识库记录，该问题的解决方案是..."}
        elif tool_use["name"] == "CreateSupportTicket":
            # 模拟创建工单
            return {"ticket_id": "ST12345", "status": "created"}

性能优化与最佳实践

配置参数调优表

参数	推荐值	说明	适用场景
maxTokens	512-4096	最大生成token数	根据响应长度需求调整
temperature	0.1-0.7	创造性程度	0.1-0.3用于确定性任务，0.4-0.7用于创造性任务
topP	0.7-0.9	核采样参数	控制输出的多样性
stopSequences	自定义	停止序列	用于控制响应长度或格式

错误处理与重试机制

import boto3
from botocore.exceptions import ClientError
import time
import logging

logging.basicConfig(level=logging.INFO)

class RobustConverseClient:
    def __init__(self, max_retries=3, backoff_factor=1):
        self.client = boto3.client("bedrock-runtime")
        self.max_retries = max_retries
        self.backoff_factor = backoff_factor
    
    def converse_with_retry(self, **kwargs):
        for attempt in range(self.max_retries):
            try:
                response = self.client.converse(**kwargs)
                return response
                
            except ClientError as e:
                error_code = e.response['Error']['Code']
                
                if error_code == 'ThrottlingException':
                    wait_time = self.backoff_factor * (2 ** attempt)
                    logging.warning(f"被限流，等待 {wait_time}秒后重试...")
                    time.sleep(wait_time)
                    continue
                    
                elif error_code == 'ModelTimeoutException':
                    logging.warning("模型超时，重试中...")
                    continue
                    
                else:
                    logging.error(f"不可重试错误: {error_code}")
                    raise
        
        raise Exception("达到最大重试次数")

# 使用示例
robust_client = RobustConverseClient()
try:
    response = robust_client.converse_with_retry(
        modelId="anthropic.claude-3-haiku-20240307-v1:0",
        messages=[{"role": "user", "content": [{"text": "测试消息"}]}]
    )
    print("调用成功")
except Exception as e:
    print(f"调用失败: {e}")

安全与合规性考虑

内容过滤与安全策略

def safe_converse_with_filtering(user_input):
    """带内容安全检查的Converse调用"""
    # 内容安全检查
    if contains_sensitive_content(user_input):
        return "抱歉，无法处理该请求"
    
    client = boto3.client("bedrock-runtime")
    
    try:
        response = client.converse(
            modelId="anthropic.claude-3-sonnet-20240229-v1:0",
            messages=[{"role": "user", "content": [{"text": user_input}]}],
            inferenceConfig={
                "maxTokens": 500,
                "temperature": 0.3
            }
        )
        
        # 响应内容检查
        output_text = response["output"]["message"]["content"][0]["text"]
        if contains_sensitive_content(output_text):
            return "响应内容不符合安全策略"
        
        return output_text
        
    except Exception as e:
        return f"处理失败: {str(e)}"

def contains_sensitive_content(text):
    """简单的内容安全检查"""
    sensitive_keywords = ["敏感词1", "敏感词2", "敏感词3"]
    return any(keyword in text.lower() for keyword in sensitive_keywords)