深入解析osquery中YARA规则查询的语法差异问题

2025-05-09 02:13:06作者：幸俭卉

项目地址：https://gitcode.com/gh_mirrors/osq/osquery

问题背景

在安全监控和系统审计工具osquery中，YARA规则是一个强大的功能模块，它允许用户通过自定义规则来扫描系统中的文件内容。然而，在实际使用过程中，用户可能会遇到一个看似奇怪的现象：同样的YARA查询语句在osquery交互式shell中能够正常工作，但在命令行直接执行时却会报错。

现象重现

当用户在osquery交互式shell中执行以下查询时：

select * from yara where path = '/tmp/test.txt' and sigrule = 'rule find_hello_world {strings: $hello_world = "test" condition: $hello_world}';

能够正常返回匹配结果。

但当尝试通过命令行直接执行相同查询时：

./osqueryd --verbose -S --json "select * from yara where path = '/tmp/test.txt' and sigrule = 'rule find_hello_world {strings: $hello_world = "test" condition: $hello_world}';"

却会收到YARA编译错误和"Query must specify sig_group, sigfile, or sigrule for scan"的警告信息。

技术分析

这个问题的根源在于shell环境下的字符串解析机制差异：

交互式shell环境：osquery shell会直接接收并处理完整的SQL语句，其中的引号嵌套关系能够被正确解析。
命令行环境：当通过命令行参数传递查询语句时，整个查询语句会先经过shell的解析处理。在这个过程中：
- 外层的双引号会被shell解释
- 内层的双引号会被提前闭合
- 导致最终传递给osquery的YARA规则字符串不完整

解决方案

要解决这个问题，需要正确处理命令行环境下的引号嵌套：

转义内部引号：对YARA规则中的双引号进行转义
转义变量符号：对YARA规则中的$符号进行转义
使用变量存储查询：将查询语句存储在变量中再传递

推荐的使用方式如下：

QUERY="select * from yara where path = '/tmp/test.txt' and sigrule = 'rule find_hello_world {strings: \$hello_world = \"test\" condition: \$hello_world}';"
osqueryd -S --json "$QUERY"