Convolutional KANs 教程

2026-01-16 09:38:00作者：明树来

This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.

项目地址：https://gitcode.com/gh_mirrors/co/Convolutional-KANs

1. 项目介绍

Convolutional KANs 是一个基于 GitHub 用户 AntonioTepsich 开发的项目，它扩展了 Kolmogorov-Arnold 网络（KAN）的思想到卷积层中。该项目旨在通过学习非线性激活函数替代传统的卷积操作中的线性转换，从而减少参数数量并提高模型的表达能力。KAN 卷积网络利用 B-Spline 进行平滑表示，允许在不增加额外计算复杂性的前提下处理更复杂的视觉信息。

2. 项目快速启动

要开始使用 Convolutional-KANs 库，首先确保已安装所有依赖项：

git clone git@github.com:AntonioTepsich/Convolutional-KANs.git
cd Convolutional-KANs
pip install -r requirements.txt

接下来，可以导入 KANConv 类来创建自定义的 KAN 卷积层：

import torch
from torch import nn
from torch.nn.functional import F
from kan_convolutional.KANConv import KAN_Convolutional_Layer

class KANC_Network(nn.Module):
    def __init__(self, device='cpu'):
        super(KANC_Network, self).__init__()
        self.conv1 = KAN_Convolutional_Layer(
            n_convs=5,
            kernel_size=(3, 3),
            device=device
        )
        # 添加更多层...
        
    def forward(self, x):
        out = self.conv1(x)
        # 添加其他操作...
        return out

model = KANC_Network().to('cuda' if torch.cuda.is_available() else 'cpu')

3. 应用案例与最佳实践

3.1 使用随机搜索进行超参数调优

在实际应用中，可以使用随机搜索策略优化模型的性能：

from sklearn.model_selection import RandomizedSearchCV
from keras.wrappers.scikit_learn import KerasClassifier

def create_model():
    model = KANC_Network()
    return model

keras_clf = KerasClassifier(build_fn=create_model, epochs=10, batch_size=32, verbose=0)
param_distribs = {
    "n_convs": list(range(3, 7)),
    "kernel_size": [(3, 3), (5, 5)],
}
clf = RandomizedSearchCV(keras_clf, param_distributions=param_distribs, cv=3)
clf.fit(X_train, y_train)