基于AlexNet的花卉图像分类实战教程

2026-02-04 04:18:11作者：庞眉杨Will

项目背景

本教程基于深度学习图像处理项目中的AlexNet分类实现，主要演示如何使用TensorFlow框架构建AlexNet模型，并完成花卉图像的分类任务。AlexNet作为深度卷积神经网络的重要里程碑，在2012年ImageNet竞赛中取得了突破性成果，至今仍是学习计算机视觉的基础模型。

环境准备

在开始之前，请确保已安装以下Python库：

TensorFlow 2.x
Matplotlib
NumPy

数据准备

项目使用花卉数据集，包含训练集和验证集。数据目录结构如下：

data_set/
    flower_data/
        train/  # 训练集
        val/    # 验证集

代码解析

1. 数据预处理

使用ImageDataGenerator进行数据增强和标准化处理：

train_image_generator = ImageDataGenerator(
    rescale=1./255,  # 归一化
    horizontal_flip=True  # 水平翻转增强
)

validation_image_generator = ImageDataGenerator(rescale=1./255)

数据生成器配置：

train_data_gen = train_image_generator.flow_from_directory(
    directory=train_dir,
    batch_size=32,
    shuffle=True,
    target_size=(224, 224),  # AlexNet标准输入尺寸
    class_mode='categorical'  # 多分类
)

2. 类别索引处理

将类别索引保存为JSON文件，便于后续预测时使用：

class_indices = train_data_gen.class_indices
inverse_dict = dict((val, key) for key, val in class_indices.items())
with open('class_indices.json', 'w') as json_file:
    json.dump(inverse_dict, json_file, indent=4)

3. 模型构建

项目提供了两种AlexNet实现方式：

AlexNet_v1: 使用Sequential API构建
AlexNet_v2: 使用Subclassing API构建

本教程使用AlexNet_v1:

model = AlexNet_v1(im_height=224, im_width=224, num_classes=5)
model.summary()  # 打印模型结构

4. 模型训练

配置训练参数和回调函数：

model.compile(
    optimizer=tf.keras.optimizers.Adam(learning_rate=0.0005),
    loss='categorical_crossentropy',
    metrics=['accuracy']
)

callbacks = [
    ModelCheckpoint(
        filepath='./save_weights/myAlex.h5',
        save_best_only=True,  # 只保存最佳模型
        monitor='val_loss'    # 根据验证损失监控
    )
]

开始训练：

history = model.fit(
    x=train_data_gen,
    steps_per_epoch=total_train // batch_size,
    epochs=10,
    validation_data=val_data_gen,
    validation_steps=total_val // batch_size,
    callbacks=callbacks
)

5. 训练可视化

绘制训练过程中的损失和准确率曲线：

plt.figure()
plt.plot(history.history['loss'], label='train_loss')
plt.plot(history.history['val_loss'], label='val_loss')
plt.legend()
plt.xlabel('Epochs')
plt.ylabel('Loss')
plt.show()