sanitize-html项目中自定义CSS变量(--resize-width)的处理技巧

2025-06-16 05:05:20作者：彭桢灵Jeremy

Clean up user-submitted HTML, preserving whitelisted elements and whitelisted attributes on a per-element basis. Built on htmlparser2 for speed and tolerance

项目地址：https://gitcode.com/gh_mirrors/sa/sanitize-html

sanitize-html作为一款广泛使用的HTML净化工具，在处理自定义CSS变量时可能会遇到一些特殊情况。本文将通过一个典型场景，深入分析如何正确处理CSS自定义属性（如--resize-width）的保留问题。

问题现象分析

当开发者尝试保留包含CSS自定义属性的style标签时，例如：

<span style="--resize-width: 379px;">...</span>

使用sanitize-html处理后，发现style属性被完全移除，即使已经在allowedStyles中明确配置了相关规则。

原因探究

这种现象主要由两个因素导致：

CSS自定义属性的特殊性：以双横线(--)开头的CSS变量在语法上不同于常规CSS属性，需要特殊处理。
正则表达式匹配问题：原始配置中的正则表达式/^[-a-zA-Z0-9_]+:\s*(.*);$/虽然理论上可以匹配自定义属性，但在实际处理过程中可能存在匹配失败的情况。

解决方案对比

方案一：优化正则表达式

理论上可以通过调整正则表达式来匹配CSS变量：

allowedStyles: {
  "*": {
    "--resize-width": [/^\d+(px|em|%)$/]
  }
}

但实际测试表明，在某些情况下这种直接匹配方式可能仍然无效。

方案二：使用transformTags钩子

更可靠的解决方案是绕过style属性的直接解析，使用transformTags进行手动处理：

{
  parseStyleAttributes: false,
  transformTags: {
    '*': function(tagName, attribs) {
      if (attribs.style) {
        const allowedStyles = ["--resize-width"];
        let cleanedStyles = [];
        
        allowedStyles.forEach(function(element) {
          const regex = new RegExp(element + ":[\\s|\\S]*?;");
          const matches = attribs.style.match(regex);
          if (matches) { 
            cleanedStyles.push(matches[0]);
          }
        });
        
        attribs.style = cleanedStyles.join(' ');
      }
      return {
        tagName: tagName,
        attribs: attribs
      };
    }
  }
}