解决GraphRAG项目中LLM调用错误的技术方案

2025-05-08 20:29:15作者：邵娇湘

在使用GraphRAG项目时，开发者可能会遇到LLM（大语言模型）调用失败的问题。本文深入分析该问题的根源，并提供一套完整的技术解决方案。

问题现象分析

当运行GraphRAG项目时，系统抛出如下错误：

Traceback (most recent call last):
  File "/graphrag/llm/base/base_llm.py", line 55, in _invoke
    output = await self._execute_llm(input, **kwargs)

这表明在调用大语言模型时出现了异常。经过深入排查，发现问题根源在于Ollama服务未正确启动。

技术背景

Ollama是一个用于本地运行大型语言模型的工具，需要保持后台服务运行才能正常响应API请求。在Kaggle等环境中，服务可能会因为各种原因未能自动启动。

解决方案

方案一：主动检测并启动服务

在base_llm.py文件中添加服务检测和启动功能：

import psutil
import subprocess
import time

def is_process_running(process_name):
    """检测指定进程是否正在运行"""
    for proc in psutil.process_iter(['pid', 'name']):
        if process_name.lower() in proc.info['name'].lower():
            return True
    return False

def start_ollama():
    """启动Ollama服务"""
    command = "nohup ollama serve &"
    process = subprocess.Popen(
        command, 
        shell=True, 
        stdout=subprocess.PIPE, 
        stderr=subprocess.PIPE
    )
    time.sleep(5)  # 等待服务初始化
    return process.pid

方案二：异常捕获后重启服务

在_invoke方法中添加异常处理逻辑：

async def _invoke(self, input: TIn, **kwargs: Unpack[LLMInput]) -> LLMOutput[TOut]:
    try:
        output = await self._execute_llm(input, **kwargs)
    except Exception as e:
        if "connection" in str(e).lower():
            start_ollama()
            # 重试逻辑
            output = await self._execute_llm(input, **kwargs)
        else:
            raise