【亲测免费】开源项目 `manga-image-translator` 使用教程

2026-01-16 09:18:54作者：韦蓉瑛

1. 项目的目录结构及介绍

manga-image-translator/
├── README.md
├── README_CN.md
├── LICENSE
├── requirements.txt
├── setup.py
├── manga_translator/
│   ├── __init__.py
│   ├── main.py
│   ├── config.py
│   ├── utils.py
│   ├── models/
│   │   ├── __init__.py
│   │   ├── text_detector.py
│   │   ├── ocr.py
│   │   ├── inpainter.py
│   │   ├── upscaler.py
│   ├── data/
│   │   ├── samples/
│   │   ├── models/
│   ├── scripts/
│   │   ├── translate.py
│   │   ├── train.py
│   ├── tests/
│   │   ├── __init__.py
│   │   ├── test_main.py
├── docs/
│   ├── installation.md
│   ├── usage.md
│   ├── api.md
│   ├── contributing.md
├── examples/
│   ├── sample_image.jpg
│   ├── translated_image.jpg

目录结构说明

README.md 和 README_CN.md: 项目介绍和使用说明。
LICENSE: 项目许可证文件。
requirements.txt: 项目依赖文件。
setup.py: 项目安装脚本。
manga_translator/: 项目主代码目录。
- main.py: 项目启动文件。
- config.py: 项目配置文件。
- utils.py: 项目工具函数。
- models/: 模型相关代码。
- data/: 数据文件，包括样本和预训练模型。
- scripts/: 脚本文件，包括翻译和训练脚本。
- tests/: 测试代码。
docs/: 项目文档。
examples/: 示例图片。

2. 项目的启动文件介绍

`main.py`

main.py 是项目的启动文件，负责初始化配置、加载模型和启动翻译服务。以下是主要功能模块：

初始化配置: 从 config.py 中读取配置参数。
加载模型: 根据配置加载文本检测、OCR、修复和放大模型。
启动服务: 提供命令行接口和Web接口，支持批量翻译和单张图片翻译。

示例代码

from manga_translator import main

if __name__ == "__main__":
    main.run()

3. 项目的配置文件介绍

`config.py`

config.py 是项目的配置文件，包含所有可配置的参数。以下是主要配置项：

模型路径: 指定预训练模型的路径。
GPU设置: 是否使用GPU进行计算。
文本检测器: 选择使用的文本检测模型。
OCR模型: 选择使用的OCR模型。
修复模型: 选择使用的图像修复模型。
放大模型: 选择使用的图像放大模型。

示例配置

# config.py

CONFIG = {
    "model_dir": "/models",
    "use_gpu": True,
    "detector": "ctd",
    "ocr": "48px",
    "inpainter": "lama_large",
    "upscaler": "waifu2x",
    "upscale_ratio": 2.0
}