feat: skip bytecode evaluation for some rules without string matches #1927

secDre4mer · 2023-06-13T16:11:53Z

Optimize a common case where YARA conditions are formed like e.g. "... and 1 of them and ...", in other words, requiring a string match to ever be true.
By noting these cases and recording in a bitmap if a string match occurred, the condition evaluation for these rules can be skipped entirely in most cases.

Optimize a common case where YARA conditions are formed like e.g. "... and 1 of them and ...", in other words, requiring a string match to ever be true. By noting these cases and recording in a bitmap if a string match occurred, the condition evaluation for these rules can be skipped entirely in most cases.

secDre4mer · 2023-06-15T08:45:11Z

In case it's relevant, some background information that led to this PR:
We use ~20000 YARA rules in a ruleset and noticed that in such a large ruleset, condition evaluation takes up a serious amount of CPU time (~50-70% of total YARA scan time).
This PR tries to reduce the overhead created by conditions (in our case, ~90% of all rules fall into the "only evaluate if a string matches" category introduced by the PR). Initial timings look good, with condition evaluation time dropping by ~65% in our case.

Some additional optimizations (like: tracking which strings actually have to match for the condition to possibly be true, tracking the number of string matches, ...) could be implemented if considered worthwhile.

plusvic · 2023-06-16T07:25:40Z

libyara/scanner.c

@@ -498,6 +508,17 @@ YR_API int yr_scanner_scan_mem_blocks(
    if (result != ERROR_SUCCESS)
      goto _exit;

+    for (i = 0; i < scanner->rules->num_rules; i++)


My only concern is this loop. The loop gets executed with every scanned file, and it may take a while when you have hundreds of thousands of rules as we have in VirusTotal. I think this could be donde more efficiently if we store the original bitmask in scanner->rules, and with every scanned file rule_evaluate_condition_flags is initialized by making a copy of the original bitmask using memcpy. This should be much more efficient as memcpy will copy multiple bits per CPU instruction, instead of having multiple instructions per bit, like in this case.

PR #1927 didn't take into account that as a side-effect of skipping the evaluation of global rules, the corresponding namespace was not flagged as unsatisfied (i.e: a global rule that evaluates to false prevents all other rules in the namespace from evaluating to true). The issue was not caught by any test case because the test case for global rules were using rules that didn't required any strings to match, and therefore were unaffected by the changes introduced in #1927. This change makes sure that any global rule that is skipped is considered false, and therefore its namespace is flagged as unsatisfied. Closes #2022.

plusvic reviewed Jun 16, 2023

View reviewed changes

secDre4mer added 2 commits June 16, 2023 10:16

feat: only initialize bitmap once at rules load time

12eb727

fix: handle calloc fail

dfa5ae0

plusvic approved these changes Jun 30, 2023

View reviewed changes

plusvic merged commit 1c309a8 into VirusTotal:master Jun 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: skip bytecode evaluation for some rules without string matches #1927

feat: skip bytecode evaluation for some rules without string matches #1927

secDre4mer commented Jun 13, 2023

secDre4mer commented Jun 15, 2023

plusvic Jun 16, 2023

secDre4mer Jun 16, 2023

feat: skip bytecode evaluation for some rules without string matches #1927

feat: skip bytecode evaluation for some rules without string matches #1927

Conversation

secDre4mer commented Jun 13, 2023

secDre4mer commented Jun 15, 2023

plusvic Jun 16, 2023

Choose a reason for hiding this comment

secDre4mer Jun 16, 2023

Choose a reason for hiding this comment