Ragas项目异步评估中的事件循环问题分析与解决方案

2025-05-26 01:07:04作者：宣海椒Queenly

问题背景

在使用Ragas项目进行问答系统评估时，开发者可能会遇到一个典型的异步编程问题——"ExceptionInRunner"错误。这个问题表现为当尝试执行evaluate函数时，系统抛出"RuntimeError: This event loop is already running"异常，导致评估过程无法正常完成。

技术原理分析

这个问题的根源在于Python的异步编程模型。在Jupyter Notebook等交互式环境中，已经存在一个正在运行的事件循环(event loop)。当Ragas的评估函数尝试创建新的事件循环时，就会产生冲突。具体表现为：

主线程已经启动了一个事件循环
Ragas的Executor尝试创建新的事件循环来执行异步任务
系统检测到事件循环冲突，抛出运行时错误

解决方案

针对这一问题，技术社区提供了一个成熟的解决方案——使用nest_asyncio库。这个库能够修补Python的asyncio模块，允许在已有事件循环的环境中嵌套执行新的异步操作。

实施步骤如下：

安装必要的库：

pip install nest_asyncio

在代码中应用补丁：

import nest_asyncio
nest_asyncio.apply()

完整示例代码：

from datasets import Dataset
import os
import nest_asyncio
from ragas import evaluate
from ragas.metrics import faithfulness, answer_correctness

# 应用事件循环嵌套补丁
nest_asyncio.apply()

# 设置API密钥
os.environ["OPENAI_API_KEY"] = "your-actual-api-key"

# 准备评估数据
data_samples = {
    'question': ['When was the first super bowl?', 'Who won the most super bowls?'],
    'answer': ['The first superbowl was held on Jan 15, 1967', 'The most super bowls have been won by The New England Patriots'],
    'contexts': [['The First AFL–NFL World Championship Game was an American football game played on January 15, 1967, at the Los Angeles Memorial Coliseum in Los Angeles,'], 
                ['The Green Bay Packers...Green Bay, Wisconsin.','The Packers compete...Football Conference']],
    'ground_truth': ['The first superbowl was held on January 15, 1967', 'The New England Patriots have won the Super Bowl a record six times']
}

# 创建数据集并执行评估
dataset = Dataset.from_dict(data_samples)
score = evaluate(dataset, metrics=[faithfulness, answer_correctness])
score.to_pandas()