NPOI项目中ByteArrayOutputStream未释放导致的内存泄漏问题分析

2025-06-05 04:33:23作者：余洋婵Anita

问题背景

在NPOI 2.7.1-rc1版本中，IOUtils工具类的ToByteArray方法存在一个潜在的内存泄漏风险。该方法用于从输入流中读取指定长度的字节数据，但在实现过程中未能正确释放ByteArrayOutputStream资源。

问题代码分析

ToByteArray方法的当前实现创建了一个ByteArrayOutputStream对象来缓冲从输入流读取的数据：

public static byte[] ToByteArray(Stream stream, int length)
{
    ByteArrayOutputStream baos = new ByteArrayOutputStream(length == Int32.MaxValue ? 4096 : length);
    
    byte[] buffer = new byte[4096];
    int totalBytes = 0, readBytes;
    do
    {
        readBytes = stream.Read(buffer, 0, Math.Min(buffer.Length, length - totalBytes));
        totalBytes += Math.Max(readBytes, 0);
        if (readBytes > 0)
        {
            baos.Write(buffer, 0, readBytes);
        }
    } while (totalBytes < length && readBytes > 0);
    
    if (length != Int32.MaxValue && totalBytes < length)
    {
        throw new IOException("unexpected EOF");
    }
    
    return baos.ToByteArray();
}

问题严重性

资源泄漏：ByteArrayOutputStream对象在方法执行完毕后没有被显式释放
内存占用：当处理大文件或频繁调用此方法时，未释放的资源会累积，可能导致内存压力增大
性能影响：长期运行的应用可能出现内存不足的情况，影响整体性能

解决方案

正确的做法是在使用完ByteArrayOutputStream后调用Dispose方法释放资源。由于ByteArrayOutputStream实现了IDisposable接口，最佳实践是使用using语句确保资源被正确释放：

public static byte[] ToByteArray(Stream stream, int length)
{
    using (ByteArrayOutputStream baos = new ByteArrayOutputStream(length == Int32.MaxValue ? 4096 : length))
    {
        byte[] buffer = new byte[4096];
        int totalBytes = 0, readBytes;
        do
        {
            readBytes = stream.Read(buffer, 0, Math.Min(buffer.Length, length - totalBytes));
            totalBytes += Math.Max(readBytes, 0);
            if (readBytes > 0)
            {
                baos.Write(buffer, 0, readBytes);
            }
        } while (totalBytes < length && readBytes > 0);
        
        if (length != Int32.MaxValue && totalBytes < length)
        {
            throw new IOException("unexpected EOF");
        }
        
        return baos.ToByteArray();
    }
}