pymoo中实现并发安全的并行评估方法

2025-07-01 09:53:59作者：庞眉杨Will

在基于pymoo框架进行优化时，经常会遇到需要将候选解写入文件、调用外部程序执行、再读取输出结果的场景。这种评估方式在并行环境下容易产生文件冲突问题。本文将详细介绍如何在pymoo中安全地实现这种并行评估模式。

问题背景

当使用pymoo进行优化时，评估函数(_evaluate)可能需要：

将候选解写入输入文件
调用外部程序处理该文件
读取输出文件获取结果

在并行环境下，多个评估过程同时进行时，如果使用相同的文件名，会导致文件读写冲突。理想情况下，每个候选解应该使用唯一的文件路径。

解决方案

pymoo提供了两种主要的问题定义方式，对应不同的并行策略：

1. 使用ElementWiseProblem

ElementWiseProblem对每个个体单独评估，适合简单场景。但正如用户发现的那样，这种方式难以在评估函数内部实现并行，因为每个个体的评估是隔离的。

2. 使用Problem类（推荐方案）

Problem类的_evaluate方法接收整个种群作为输入，这使得我们可以在方法内部实现并行处理：

class MyProblem(Problem):
    def __init__(self):
        super().__init__(n_var=10, n_obj=1, n_constr=0)
    
    def _evaluate(self, X, out, *args, **kwargs):
        # X包含整个种群的候选解
        results = []
        
        # 为每个个体生成唯一ID
        ids = [f"temp_{i}" for i in range(len(X))]
        
        # 并行处理所有个体
        with ThreadPoolExecutor() as executor:
            futures = []
            for x, id in zip(X, ids):
                futures.append(executor.submit(self._evaluate_single, x, id))
            
            results = [f.result() for f in futures]
        
        out["F"] = np.array(results)
    
    def _evaluate_single(self, x, id):
        # 使用唯一ID创建文件
        input_file = f"/tmp/{id}.in"
        output_file = f"/tmp/{id}.out"
        
        # 写入输入文件
        with open(input_file, "w") as f:
            f.write(str(x))
        
        # 调用外部程序
        subprocess.run(["./my_program", input_file, output_file], 
                      capture_output=True)
        
        # 读取输出
        with open(output_file, "r") as f:
            result = float(f.read())
        
        return result