scikit-opt中的遗传算法（GA）详解

2026-02-04 04:27:13作者：幸俭卉

引言：优化问题的智能求解新范式

在工程优化、机器学习参数调优、路径规划等众多领域，我们经常面临复杂的非线性优化问题。传统数学方法往往陷入局部最优解，而遗传算法（Genetic Algorithm，GA）作为一种仿生智能优化算法，通过模拟自然选择和遗传机制，能够有效跳出局部最优，寻找全局最优解。

scikit-opt作为Python生态中强大的群体智能算法库，提供了完整且高效的遗传算法实现。本文将深入解析scikit-opt中遗传算法的核心原理、实现细节和实际应用，帮助读者掌握这一强大的优化工具。

遗传算法核心原理

算法基本流程

遗传算法模拟生物进化过程，主要包含以下步骤：

flowchart TD
    A[初始化种群] --> B[染色体解码]
    B --> C[适应度评估]
    C --> D[选择操作]
    D --> E[交叉操作]
    E --> F[变异操作]
    F --> G{满足终止条件?}
    G -->|是| H[输出最优解]
    G -->|否| B

关键概念解析

生物学术语	遗传算法对应	在scikit-opt中的实现
染色体（Chromosome）	解编码	二进制串或实数编码
基因（Gene）	解的分量	染色体中的每一位
种群（Population）	解集合	`Chrom`矩阵
适应度（Fitness）	目标函数值	`FitV`向量
选择（Selection）	优胜劣汰	锦标赛选择、概率选择
交叉（Crossover）	基因重组	单点交叉、两点交叉
变异（Mutation）	基因突变	位翻转变异

scikit-opt遗传算法架构解析

核心类结构

scikit-opt提供了多种遗传算法变体，主要包含以下类：

classDiagram
    class GeneticAlgorithmBase {
        +Chrom: 种群染色体
        +X: 解码后的解
        +Y: 适应度值
        +FitV: 适应度评估值
        +run(): 执行算法
        +ranking(): 适应度排序
        +selection(): 选择操作
        +crossover(): 交叉操作
        +mutation(): 变异操作
    }
    
    class GA {
        +gray2rv(): 格雷码转实数
        +crtbp(): 创建初始种群
    }
    
    class EGA {
        +n_elitist: 精英保留数量
    }
    
    class RCGA {
        +crossover_SBX(): 模拟二进制交叉
    }
    
    class GA_TSP {
        +crossover_pmx(): 部分匹配交叉
    }
    
    GeneticAlgorithmBase <|-- GA
    GA <|-- EGA
    GeneticAlgorithmBase <|-- RCGA
    GeneticAlgorithmBase <|-- GA_TSP

编码与解码机制

scikit-opt支持多种编码方式，其中最常用的是二进制编码和实数编码：

二进制编码（GA类）

# 染色体解码过程
def chrom2x(self, Chrom):
    cumsum_len_segment = self.Lind.cumsum()
    X = np.zeros(shape=(self.size_pop, self.n_dim))
    for i, j in enumerate(cumsum_len_segment):
        if i == 0:
            Chrom_temp = Chrom[:, :cumsum_len_segment[0]]
        else:
            Chrom_temp = Chrom[:, cumsum_len_segment[i-1]:cumsum_len_segment[i]]
        X[:, i] = self.gray2rv(Chrom_temp)  # 格雷码转实数
    
    X = self.lb + (self.ub - self.lb) * X  # 映射到实际取值范围
    return X

实数编码（RCGA类）

# 实数编码直接使用随机浮点数
def crtbp(self):
    self.Chrom = np.random.random([self.size_pop, self.n_dim])
    return self.Chrom

def chrom2x(self, Chrom):
    X = self.lb + (self.ub - self.lb) * self.Chrom
    return X

遗传操作符实现

选择操作（Selection）

scikit-opt提供了多种选择策略：

# 锦标赛选择（默认使用）
def selection_tournament_faster(self, tourn_size=3):
    aspirants_idx = np.random.randint(self.size_pop, 
                                    size=(self.size_pop, tourn_size))
    aspirants_values = self.FitV[aspirants_idx]
    winner = aspirants_values.argmax(axis=1)
    sel_index = [aspirants_idx[i, j] for i, j in enumerate(winner)]
    self.Chrom = self.Chrom[sel_index, :]
    return self.Chrom

# 概率选择
def selection_probability_1(self):
    FitV = self.FitV
    FitV = FitV - FitV.min() + 1e-10
    sel_prob = FitV / FitV.sum()
    sel_index = np.random.choice(range(self.size_pop), 
                               size=self.size_pop, p=sel_prob)
    self.Chrom = self.Chrom[sel_index, :]
    return self.Chrom

交叉操作（Crossover）

# 两点交叉（二进制编码）
def crossover_2point_bit(self):
    half_size_pop = int(self.size_pop / 2)
    Chrom1, Chrom2 = self.Chrom[:half_size_pop], self.Chrom[half_size_pop:]
    mask = np.zeros(shape=(half_size_pop, self.len_chrom), dtype=int)
    
    for i in range(half_size_pop):
        n1, n2 = np.random.randint(0, self.len_chrom, 2)
        if n1 > n2: n1, n2 = n2, n1
        mask[i, n1:n2] = 1
    
    mask2 = (Chrom1 ^ Chrom2) & mask
    Chrom1 ^= mask2
    Chrom2 ^= mask2
    return self.Chrom

# 部分匹配交叉（TSP问题）
def crossover_pmx(self):
    for i in range(0, self.size_pop, 2):
        Chrom1, Chrom2 = self.Chrom[i], self.Chrom[i+1]
        cxpoint1, cxpoint2 = np.random.randint(0, self.len_chrom-1, 2)
        if cxpoint1 >= cxpoint2:
            cxpoint1, cxpoint2 = cxpoint2, cxpoint1 + 1
        
        # 执行部分匹配交叉
        pos1_recorder = {value: idx for idx, value in enumerate(Chrom1)}
        pos2_recorder = {value: idx for idx, value in enumerate(Chrom2)}
        
        for j in range(cxpoint1, cxpoint2):
            value1, value2 = Chrom1[j], Chrom2[j]
            pos1, pos2 = pos1_recorder[value2], pos2_recorder[value1]
            Chrom1[j], Chrom1[pos1] = Chrom1[pos1], Chrom1[j]
            Chrom2[j], Chrom2[pos2] = Chrom2[pos2], Chrom2[j]
            pos1_recorder[value1], pos1_recorder[value2] = pos1, j
            pos2_recorder[value1], pos2_recorder[value2] = j, pos2

变异操作（Mutation）

# 基本位变异
def mutation(self):
    mask = (np.random.rand(self.size_pop, self.len_chrom) < self.prob_mut)
    self.Chrom ^= mask  # 异或操作实现位翻转
    return self.Chrom

# TSP问题的逆转变异
def mutation_reverse(self):
    for i in range(self.size_pop):
        if np.random.rand() < self.prob_mut:
            self.Chrom[i] = reverse(self.Chrom[i])
    return self.Chrom

def reverse(individual):
    n1, n2 = np.random.randint(0, individual.shape[0]-1, 2)
    if n1 >= n2: n1, n2 = n2, n1 + 1
    individual[n1:n2] = individual[n1:n2][::-1]
    return individual

实际应用案例

案例1：函数优化问题

import numpy as np
from sko.GA import GA

# 定义Schaffer测试函数（多局部极值点）
def schaffer(p):
    x1, x2 = p
    part1 = np.square(x1) - np.square(x2)
    part2 = np.square(x1) + np.square(x2)
    return 0.5 + (np.square(np.sin(part1)) - 0.5) / np.square(1 + 0.001 * part2)

# 创建遗传算法实例
ga = GA(func=schaffer, n_dim=2, size_pop=50, max_iter=800, 
        prob_mut=0.001, lb=[-1, -1], ub=[1, 1], precision=1e-7)

# 执行优化
best_x, best_y = ga.run()
print(f'最优解: {best_x}, 最优值: {best_y}')

案例2：旅行商问题（TSP）

import numpy as np
from scipy import spatial
from sko.GA import GA_TSP

# 生成随机城市坐标
num_points = 20
points_coordinate = np.random.rand(num_points, 2)
distance_matrix = spatial.distance.cdist(points_coordinate, 
                                       points_coordinate, metric='euclidean')

# 定义目标函数（总路径长度）
def cal_total_distance(routine):
    return sum([distance_matrix[routine[i % num_points], 
                routine[(i + 1) % num_points]] for i in range(num_points)])

# 创建TSP遗传算法实例
ga_tsp = GA_TSP(func=cal_total_distance, n_dim=num_points, 
                size_pop=100, max_iter=500, prob_mut=0.2)

# 求解最优路径
best_route, best_distance = ga_tsp.run()
print(f'最优路径: {best_route}, 最短距离: {best_distance}')

案例3：带约束的优化问题

from sko.GA import GA

# 定义目标函数和约束
def objective_func(x):
    return x[0]**2 + x[1]**2 + x[2]**2

# 等式约束：x1 + x2 = 1
constraint_eq = [lambda x: 1 - x[0] - x[1]]

# 不等式约束：x0*x1 >= 1, x0*x1 <= 5
constraint_ueq = [
    lambda x: 1 - x[0] * x[1],
    lambda x: x[0] * x[1] - 5
]

# 创建带约束的遗传算法实例
ga_constrained = GA(func=objective_func, n_dim=3, size_pop=50, max_iter=500,
                   lb=[0, 0, 0], ub=[5, 5, 5],
                   constraint_eq=constraint_eq, constraint_ueq=constraint_ueq)

best_x, best_y = ga_constrained.run()

高级特性与性能优化

用户自定义操作符（UDF）

scikit-opt支持用户自定义遗传操作符，极大增强了算法的灵活性：

from sko.GA import GA
from sko.operators import ranking, crossover, mutation

# 自定义选择操作符
def custom_selection(algorithm, elite_size=5):
    # 精英保留策略
    elite_idx = algorithm.FitV.argsort()[-elite_size:]
    selected_idx = list(elite_idx)
    
    # 锦标赛选择剩余个体
    for _ in range(algorithm.size_pop - elite_size):
        aspirants = np.random.choice(algorithm.size_pop, size=3)
        selected_idx.append(max(aspirants, key=lambda i: algorithm.FitV[i]))
    
    algorithm.Chrom = algorithm.Chrom[selected_idx, :]
    return algorithm.Chrom

# 注册自定义操作符
ga = GA(func=schaffer, n_dim=2, size_pop=50, max_iter=200)
ga.register(operator_name='selection', operator=custom_selection, elite_size=5)
ga.register(operator_name='ranking', operator=ranking.ranking)
ga.register(operator_name='crossover', operator=crossover.crossover_2point)
ga.register(operator_name='mutation', operator=mutation.mutation)

best_x, best_y = ga.run()

并行计算加速

scikit-opt支持多进程并行计算，显著提升大规模问题的求解速度：

# 使用多进程加速适应度计算
ga_parallel = GA(func=complex_function, n_dim=10, size_pop=100, 
                 max_iter=1000, n_processes=4)  # 使用4个进程

# GPU加速（需要PyTorch）
ga_gpu = GA(func=complex_function, n_dim=10, size_pop=100, max_iter=1000)
ga_gpu = ga_gpu.to('cuda')  # 转移到GPU计算

算法参数调优指南

参数	推荐范围	影响说明	调整策略
`size_pop`	20-200	种群大小	问题复杂度高时增大
`max_iter`	100-5000	最大迭代次数	根据收敛情况调整
`prob_mut`	0.001-0.1	变异概率	避免早熟收敛时增大
`lb/ub`	问题相关	变量边界	根据实际问题设定
`precision`	1e-3-1e-7	求解精度	精度要求高时减小

算法性能分析与对比

收敛性分析

通过分析历代最优解的变化，可以评估算法的收敛性能：

import matplotlib.pyplot as plt
import pandas as pd

# 绘制收敛曲线
Y_history = pd.DataFrame(ga.all_history_Y)
plt.figure(figsize=(10, 6))
plt.subplot(2, 1, 1)
plt.plot(Y_history.index, Y_history.values, '.', alpha=0.5, color='red')
plt.title('历代所有个体适应度值')

plt.subplot(2, 1, 2)
plt.plot(Y_history.min(axis=1).cummin(), linewidth=2)
plt.title('历代最优适应度值变化')
plt.xlabel('迭代次数')
plt.ylabel('适应度值')
plt.tight_layout()
plt.show()

与其他优化算法对比

算法类型	优点	缺点	适用场景
遗传算法（GA）	全局搜索能力强，并行性好	收敛速度较慢，参数敏感	多峰函数、组合优化
粒子群优化（PSO）	收敛速度快，参数少	易陷入局部最优	连续优化问题
差分进化（DE）	鲁棒性强，精度高	计算开销大	高精度优化
模拟退火（SA）	理论保证全局收敛	收敛速度慢	小规模问题