Kavita项目中的CBZ文件压缩优化指南

2025-05-30 22:09:50作者：冯梦姬Eddie

Kavita is a fast, feature rich, cross platform reading server. Built with the goal of being a full solution for all your reading needs. Setup your own server and share your reading collection with your friends and family.

项目地址：https://gitcode.com/gh_mirrors/ka/Kavita

问题背景

在使用Kavita漫画阅读器时，用户可能会遇到打开CBZ格式漫画文件时CPU占用率高、加载时间过长的问题。经过技术分析，这主要与CBZ文件的压缩方式有关。

技术分析

CBZ文件本质上是一种特殊的ZIP压缩包，但很多用户会使用7-Zip等工具创建这类文件，导致以下三种不利因素：

错误的压缩格式：文件虽然使用.cbz扩展名，但实际采用7zip的LZMA2:24压缩算法而非标准ZIP格式
高强度压缩：LZMA2算法虽然压缩率高，但解压需要大量CPU资源
固实压缩(Solid Compression)：这种压缩方式将所有文件视为一个连续数据块，导致必须解压整个文件才能访问其中任何内容

解决方案

最优压缩方案

对于漫画阅读场景，推荐采用"存储(Store)"模式压缩，即不对图像文件进行压缩处理。这是因为：

图像文件本身已经是压缩格式(如JPEG、PNG等)
二次压缩效果有限，通常只能节省2-5%空间
解压速度可提升数十倍

批量转换脚本

对于已有的大量CBZ文件，可以使用Python脚本进行批量转换：

#!/usr/bin/env python

import os
import sys
import subprocess
import shutil

def repair(cbz_path):
    print(cbz_path)
    tmp_dir = cbz_path + '.tmp'
    bak_path = cbz_path + '.bak'

    subprocess.check_call(['7za', 'x', cbz_path, '-o' + tmp_dir])
    os.rename(cbz_path, bak_path)

    files = map(lambda p: os.path.join(tmp_dir, p), os.listdir(tmp_dir))
    subprocess.check_call(['zip', '-j', '-0', cbz_path] + list(files))

    shutil.rmtree(tmp_dir)
    os.remove(bak_path)

def main(scan_dir):
    for root, dirs, files in os.walk(scan_dir):
        for f in files:
            if os.path.splitext(f)[1] == '.cbz':
                repair(os.path.join(root, f))

if __name__ == '__main__':
    main(sys.argv[1])