实时动作识别项目教程

2026-01-19 10:58:02作者：傅爽业Veleda

Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)

项目地址：https://gitcode.com/gh_mirrors/re/Realtime-Action-Recognition

1. 项目的目录结构及介绍

Realtime-Action-Recognition/
├── data/
│   ├── demo_data/
│   ├── skeleton_data/
│   └── video_data/
├── models/
│   ├── action_recognition_model/
│   └── pose_estimation_model/
├── scripts/
│   ├── preprocessing_scripts/
│   └── training_scripts/
├── config/
│   ├── config.yaml
│   └── parameters.json
├── main.py
├── README.md
└── LICENSE

data/: 存储项目所需的数据文件，包括演示数据、骨骼数据和视频数据。
models/: 存储动作识别和姿态估计的模型文件。
scripts/: 包含数据预处理和模型训练的脚本。
config/: 项目的配置文件，包括配置参数和模型参数。
main.py: 项目的启动文件。
README.md: 项目说明文档。
LICENSE: 项目许可证。

2. 项目的启动文件介绍

main.py 是项目的启动文件，负责初始化配置、加载模型和启动实时动作识别系统。以下是 main.py 的主要功能：

读取配置文件 config/config.yaml。
加载动作识别模型和姿态估计模型。
启动摄像头或读取视频文件进行实时动作识别。

import configparser
import cv2
from models.action_recognition_model import ActionRecognitionModel
from models.pose_estimation_model import PoseEstimationModel

def main():
    # 读取配置文件
    config = configparser.ConfigParser()
    config.read('config/config.yaml')

    # 加载模型
    action_model = ActionRecognitionModel(config)
    pose_model = PoseEstimationModel(config)

    # 启动摄像头或读取视频文件
    cap = cv2.VideoCapture(0)
    while True:
        ret, frame = cap.read()
        if not ret:
            break

        # 进行姿态估计和动作识别
        pose_results = pose_model.predict(frame)
        action_results = action_model.predict(pose_results)

        # 显示结果
        cv2.imshow('Action Recognition', frame)
        if cv2.waitKey(1) & 0xFF == ord('q'):
            break

    cap.release()
    cv2.destroyAllWindows()

if __name__ == '__main__':
    main()

3. 项目的配置文件介绍

config/config.yaml 是项目的配置文件，包含了模型路径、参数设置和其他配置信息。以下是配置文件的主要内容：

model_paths:
  action_recognition: 'models/action_recognition_model/model.h5'
  pose_estimation: 'models/pose_estimation_model/model.h5'

parameters:
  detection_threshold: 0.5
  tracking_buffer_size: 10
  action_recognition_threshold: 0.8

video_settings:
  input_device: 0  # 摄像头设备ID或视频文件路径
  frame_width: 640
  frame_height: 480
  fps: 30

model_paths: 指定动作识别和姿态估计模型的路径。
parameters: 包含检测阈值、跟踪缓冲区大小和动作识别阈值等参数。
video_settings: 视频输入设备的设置，包括输入设备ID、帧宽、帧高和帧率。

通过修改 config.yaml 文件，可以调整项目的运行参数和模型路径，以适应不同的应用场景和需求。

Realtime-Action-Recognition

Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)

项目地址：https://gitcode.com/gh_mirrors/re/Realtime-Action-Recognition

登录后查看全文