Fuzzing with american fuzzy lop #364

ilammy · 2019-02-03T09:28:31Z

This one has been on my mind for quite a while and finally I've managed to get it going. Let's throw a security fuzzer into the breach and see what it finds for us. (Oh boy, some crashes does it find! I've got seven of them reported in seven minutes. Based on a single input file with no other hints.)

Here we add american fuzzy lop because I like how little tweaking and configuration it requires. Basically, you feed it an example input data and it uses — high technology from 1960s — artificial intelligence (!) and machine learning (!!) to work through big data (!!!) queue of tests trying to invent ones that crash your application.

You can read user manual (of a sort) in the README. In order to run the fuzzer you need to install it first (apt get afl or similar) and then do make fuzz FUZZ_BIN=something to build and run the tools.

Implementation-wise, we make a custom build of Themis (by recursively calling make because that's the easiest way) which is instrumented by a special compiler. This allows the fuzzer to monitor the behavior of Themis and the tools and see how the input influences the control flow in the program.

Only two tools are implemented for starters:

Round-trip through a Secure Cell in sealing mode.

This makes sure that user input cannot be used to produce a secure container which crashes the application during processing.
Decrypting a container with Secure Cell in sealing mode.

This makes sure that data corruption cannot cause the application to crash when receiving messages encrypted with Secure Cell from untrusted sources.

It should be easy to add more tools in the future. For example, Secure Message could use a fuzzer for key files. Other components use the same containers as Secure Cell so fuzzing the encrypted data may not be so fruitful, but it may still be worth a shot.

I admit that error handling and memory management in the tools is a bit sloppy but that should be fine unless it crashes in unexpected places. It's just too verbose to do everything right.

Finally, we add a make fuzz step to the CI build in order to keep up with API changes. We don't run the fuzzing automatically but let's at least ensure that the tools can be compiled. (We can also check that they can handle the input data but it's not that important.)

This one has been on my mind for quite a while and finally I've managed to get it going. Let's throw a security fuzzer into the breach and see what it finds for us. (Oh boy, some crashes does it find! I've got 7 of them reported in 5 minutes.) Here we add "american fuzzy lop" because I like how little tweaking and configuration it requires. Basically, you feed it an example input data and it uses -- high technology from 1960s -- artificial intelligence (!) and machine learning (!!) to work through big data queue (!!!) of tests trying to invent ones that crash your application. You can read user manual (of a sort) in the README. In order to run the fuzzer you need to install it and then do "make fuzz FUZZ_BIN=something" to build and run the tools. Implementation-wise, we make a custom build of Themis (by recursively calling make because that's the easiest way) which is instrumented by a special compiler. This allows the fuzzer to monitor the behavior of Themis and the tools and see how the input influences the control flow in the program. Only two tools are implemented for starters: - Round-trip through a Secure Cell in sealing mode. This makes sure that user input cannot be used to produce a secure container which crashes the application during processing. - Decrypting a presumable container with Secure Cell in sealing mode. This makes sure that data corruption cannot cause the application to crash when receiving messages encrypted with Secure Cell. It should be easy to add more tools in the future. For example, Secure Message could use a fuzzer for key files. Other components use the same containers as Secure Cell so fuzzing the encrypted data may not be so fruitful, but it may still be worth a shot. I admit that error handling and memory management in the tools is a bit sloppy but that should be fine unless it crashes in unexpected places. It's just too verbose to do *everything* right. Finally, we add a "make fuzz" step to the CI build in order to keep up with API changes. We don't run the fuzzing automatically but let's at least ensure that the tools can be compiled. (We can also check that they can handle the input data but it's not *that* important.)

The makefile unconditionally includes "tools/afl/fuzzy.mk" so it has to be available during Themis builds. Rust wrapper's "libthemis-src" embeds Themis source code, but tries to minimize the footprint by including only bare necessities. This new file is one of them. (And here I started wondering whether this setup is a good one... Maybe we should simply symlink the whole Themis repo directory and deal with it by adding a special script for publishing crates.)

vixentael · 2019-02-03T16:13:08Z

Wow, that's really cool!
I'll play with it too :)

I believe in future we might want to add fuzzer tests in CI as well (depends on false positives number).

vixentael · 2019-02-03T17:13:47Z

Do you find way to investigate crash results?

I found that afl has crash triage tool for pointing out the crash source, but it's written for GDB (I have LLDB), and I haven't found an easy way to point LLDB to the crash.

gene-eu-zz · 2019-02-03T18:08:22Z

In fact, we've manually fuzzed Themis until some version, but results and methodology were rather obscure (it fruited in some bugs we've removed, but last 2 years we barely did it due to limited changes in Themis core). But if we can come up with plausible automatic scheme and add it to CI, my heart will beat calmer.

ilammy · 2019-02-03T20:14:44Z

I believe in future we might want to add fuzzer tests in CI as well (depends on false positives number).

Automation raises a whole bunch of new questions. For example, how are we going to collect the results? We'll need to salvage the test inputs with interesting behavior, get them out of the build box somehow, etc. Then there's resource allocation where fuzzing may find something interesting, but this would require a couple of hours of torture tests or something. There's little value in shallow tests that can be run in five minutes but do not yield any interesting results in years.

I agree that this is an interesting topic to pursue though.

Do you find way to investigate crash results?

I've reproduced the crashes manually and saved the input data for further investigation. I haven't looked into the cause deeply yet other than running it under debugger and confirming that the crashes are caused by some OpenSSL functions.

I have seen a bunch of helper tools to deal with reporting automatically, but did not look too deeply into that yet as well. For now the developers will have to manually pick up the tool binary from build directory, pipe the failing test case into it, and observe the results. I imagine it can be automated.

ilammy added core Themis Core written in C, its packages infrastructure Automated building and packaging labels Feb 3, 2019

ilammy requested review from vixentael, ignatk, Lagovas and shadinua February 3, 2019 09:28

vixentael approved these changes Feb 3, 2019

View reviewed changes

Lagovas approved these changes Feb 4, 2019

View reviewed changes

vixentael merged commit 1905a77 into cossacklabs:master Feb 4, 2019

ilammy deleted the afl branch February 5, 2019 11:49

ilammy mentioned this pull request Feb 6, 2019

Fix overflows in Secure Cell #367

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fuzzing with american fuzzy lop #364

Fuzzing with american fuzzy lop #364

ilammy commented Feb 3, 2019

vixentael commented Feb 3, 2019

vixentael commented Feb 3, 2019

gene-eu-zz commented Feb 3, 2019

ilammy commented Feb 3, 2019

Fuzzing with american fuzzy lop #364

Fuzzing with american fuzzy lop #364

Conversation

ilammy commented Feb 3, 2019

vixentael commented Feb 3, 2019

vixentael commented Feb 3, 2019

gene-eu-zz commented Feb 3, 2019

ilammy commented Feb 3, 2019