Initial implementation of tests with sanitizers #21

tgross35 · 2023-10-06T05:34:39Z

It was proposed to start running sanitizers as part of occasional CI in this same way we run Miri, this is a first pass at doing that.

Originally my plan was to fork the repo but since there is quite a bit of common behavior, I think it could make sense to keep it together.

RalfJung · 2023-10-06T05:56:50Z

My concern is that I have basically zero experience with sanitizers, so I'd be rather helpless when something goes wrong with them.

Would it be possible to isolate these tests more from each other? Like, have a miri subfolder and a sanitizer subfolder, and have the rust-src.diff and rust-version file inside those subfolders, so that I can bump and patch the Miri version without affecting the sanitizers?

RalfJung · 2023-10-06T05:57:29Z

ci-sanitizers-test.sh

+esac
+
+
+# run the tests (some also without validation, to exercise those code paths in Miri)


The comments here still refer to Miri? Also running on other targets will not work so easily with sanitizers.

RalfJung · 2023-10-06T05:58:58Z

Or alternatively, would it make sense to look into having this as part of rustc CI?

tgross35 · 2023-10-06T06:18:29Z

I wasn’t expecting a review quite so fast 😂 I just started a PR so I could see CI results. So a lot of the comments aren’t updated

Do you mean as a periodic task in rust-lang/rust? I suppose that would be an option too, but I don’t know how that would interact with everything else if these tests fail. I don’t really know how likely failure are, this will be interesting.

If you’d prefer it in a separate repo that’s fine too of course. It just seemed that with a lot of overlap, having one place to make any setup changes is easier than two.

Regarding failures, I honestly don’t know what to expect with this and I don’t know who maintains the implementation. This came out if a suggestion to dogfood the sanitizers feature since it may head to stabilization soon. So I’ll fix the directories, and just consider this borrowing your CI for a bit until someone else can chime in :)

RalfJung · 2023-10-06T06:25:44Z

Do you mean as a periodic task in rust-lang/rust? I suppose that would be an option too, but I don’t know how that would interact with everything else if these tests fail. I don’t really know how likely failure are, this will be interesting.

Periodic, or even with each PR, not sure how far we want to go.

If you’d prefer it in a separate repo that’s fine too of course. It just seemed that with a lot of overlap, having one place to make any setup changes is easier than two.

How big is the overlap in the end? All of the scripts that run the actual tests are separate, right? The overlap is in the crate setup that lets us invoke cargo test in the first place?

I'd say it depends on how much the parts that are different can be isolated, so that I don't have to worry about the sanitizer part when I need to patch the Miri part. If that can be done, then I'm fine with sharing the parts that can be shared.

Also someone should feel in charge of keeping this working so I can ping them in case of trouble. Would you be that someone?

RalfJung · 2023-10-06T06:27:34Z

We should probably rename the project if we land this.^^ miri-test-libstd was anyway not a great name since it also tests libcore and more...

tgross35 · 2023-10-06T06:47:41Z

Periodic, or even with each PR, not sure how far we want to go.

You can only compile with one sanitizer at a time and we supposedly support like 7,

If you’d prefer it in a separate repo that’s fine too of course. It just seemed that with a lot of overlap, having one place to make any setup changes is easier than two.

How big is the overlap in the end? All of the scripts that run the actual tests are separate, right? The overlap is in the crate setup that lets us invoke cargo test in the first place?

I'd say it depends on how much the parts that are different can be isolated, so that I don't have to worry about the sanitizer part when I need to patch the Miri part. If that can be done, then I'm fine with sharing the parts that can be shared.

Yeah, that's about it. It really makes no difference whether it is here or elsewhere, it just seemed maybe nice to keep the similar structure together (but it's also not running yet so that could change).

Also someone should feel in charge of keeping this working so I can ping them in case of trouble. Would you be that someone?

I wouldn't mind being that someone, but I'm also not a team member. I'm sure someone there's at least one person on the compiler team that wouldn't mind being a fallback, I'll ask around once I (hopefully) get this working.

We should probably rename the project if we land this.^^ miri-test-libstd was anyway not a great name since it also tests libcore and more...

It it winds up that we have different repos, I'm voting to name this one SANity check :)

It will be pretty interesting to see how the results of this all compare to Miri. I suspect Miri catches a lot more, but it's interesting that some of the sanitizers can see through to the C side as well (have to figure that bit out yet...)

tgross35 · 2023-10-06T09:50:21Z

Finally just told it to mark everything as a pass so we'd see actual results. Initial group of failures looks pretty repetitive:

Some weird uninit thing A B
~~test_str_truncate_split_codepoint~~ test_try_reserve must have something unusual under the hood, it says we're requesting the max allocation size A B
A data race in memcpy? A B
An allocation error similar to 2 A
SIMD looks like it has a different uninit error A but it's the first create to pass ASAN and leaksan!
Some segfault in stdarch? A
I can't even get cfi to compile because it seems to want conflicting flags

I'm sure a lot of those could be false positives, just need to do a bit of chasing.

Everything passed stacksafe

saethlin · 2023-10-06T15:36:43Z

Hm. My biggest concern at the moment is that the backtraces aren't useful, I'm not confident in debugging from CI without good backtraces. Are we somehow compiling without debuginfo?

tgross35 · 2023-10-06T18:05:56Z

I guess llvm-symbolizer needed to be installed for better output, the first result is now https://github.com/rust-lang/miri-test-libstd/actions/runs/6434877064/job/17474985110?pr=21#step:4:69. Still not that easy to pinpoint but better, if I'm reading that right it seems like maybe it's yelling at something inside the panic handler?

tgross35 · 2023-10-06T18:19:09Z

At least the tryreserve test can be explained https://github.com/rust-lang/rust/blob/64fa0c34d7cb1a2d522414ab2c87024e465bd613/library/alloc/tests/vec.rs#L1641

saethlin · 2023-10-06T18:24:40Z

Yep, in my experience it's pretty common that you need to permit allocator failure due to tests like this.

The MSan backtraces look like this:

  0.000006       #29 0x55e06d4e357c in std::rt::lang_start::hff29f05a9594b9e8 /rustc/e0d7ed1f453fb54578cc96dfea859b0e7be15016/library/std/src/rt.rs:165:17
  0.000007       #30 0x55e06d22d22f in main (/home/runner/work/miri-test-libstd/miri-test-libstd/target/x86_64-unknown-linux-gnu/debug/deps/alloc_run_test-fa2f54cc8400c238+0x119d22f) (BuildId: 80acc57dbc04badae46dbaa41120bfe55e537384)
  0.000006       #31 0x7f0071229d8f  (/lib/x86_64-linux-gnu/libc.so.6+0x29d8f) (BuildId: 229b7dc509053fe4df5e29e8629911f0c3bc66dd)
  0.000006       #32 0x7f0071229e3f in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x29e3f) (BuildId: 229b7dc509053fe4df5e29e8629911f0c3bc66dd)
  0.000006       #33 0x55e06c0cf014 in _start (/home/runner/work/miri-test-libstd/miri-test-libstd/target/x86_64-unknown-linux-gnu/debug/deps/alloc_run_test-fa2f54cc8400c238+0x3f014) (BuildId: 80acc57dbc04badae46dbaa41120bfe55e537384)

The paths suggest we've somehow linked together artifacts from the local build and the precompiled standard library from rustup. MSan false positives are pretty common when you only instrument part of the program, which sure looks like is happening here.

It wouldn't surprise me if straightening out whatever is causing this fixes all the other errors.

saethlin · 2023-10-06T20:25:00Z

I think we're better off setting ASAN_OPTIONS something like

export ASAN_OPTIONS="detect_leaks=0:detect_stack_use_after_return=true:allocator_may_return_null=1:detect_invalid_pointer_pairs=2"

At the very least letting the allocator return null. I honestly don't trust no_sanitize, partly-instrumented programs tend to throw false positives and we'll have to add another patch for every test that needs a very big allocation.

tgross35 · 2023-10-06T21:25:49Z

I updated the flags but didn't add detect_leaks - are there a lot of false positives with that one?

Do you know of a way to fix cross-build linking? I am sort of wondering if maybe it is better to let x.py take a sanitizer option and build from source before running these tests. That would probably mean a separate repo from this one.

We could almost do it as part of rust-lang/rust CI like Ralf suggested, since there's about an hour free time between x86 finishing and a full run. But we would want to reuse the built artifacts but parallelize these sanitizer tests, I don't know of a good way to do that

saethlin · 2023-10-06T21:39:14Z

That's the pile of flags I use to look for UB. Leaks are just annoying, and I've seen tests are supposed to leak. Happy to start with a stricter approach for the standard library tests. (I'm not aware of false positives)

tgross35 · 2023-10-06T21:42:15Z

I guess that if we can't rely on no_sanitize for an intentionally leaky test then it sure seems pretty useless :)

RalfJung · 2024-10-01T09:00:12Z

@tgross35 I am going to close this PR due to inactivity. Feel free to reopen when you want to get back to this. :)

Initial implementation of tests with sanitizers

e66c6e3

tgross35 force-pushed the sanitizers branch from 78b6d0a to e66c6e3 Compare October 6, 2023 05:35

RalfJung reviewed Oct 6, 2023

View reviewed changes

tgross35 added 21 commits October 6, 2023 03:15

Install lib source, update scripts

fcf463d

Command typo

1a7aec5

Unbound var

554f05b

Fix more bash errors

d8c1612

Enable tests for all crates

cf1f881

Fix workflow

5417399

Update test matrix

89838b6

fix workflow

1241d06

fix workflow

88be511

Update workflow

86b2c45

Disable cfi for now

8c7cb72

Update workflow

b477e37

Update CI once again

a93f53c

Update wf

7b57cc2

Update wf

d3753c8

Update wf

0cdc3bf

Update wf

6b283c0

Update wf

3951c08

Update wf

e258e7e

Update wf to not exist so fast

e903448

update wf

e9eb56c

tgross35 added 3 commits October 6, 2023 05:16

update wf

8711457

Remove filtering of tests

8f89583

Fix misquote in ci

75cbfc3

tgross35 mentioned this pull request Oct 6, 2023

Tracking issue for sanitizer support rust-lang/rust#39699

Open

5 tasks

Add llvm-symbolizer to CI

a217313

tgross35 added 6 commits October 6, 2023 14:41

Add patches to rust source

a799c45

Update pathces and cfg

f10922a

Fix path

12fad0a

Fix syntax

aa9b07c

Fix stdarch test, clean CI script and unused warnings

a2ea0ff

Cleanup attributes in alloc

b44c6ee

tgross35 added 2 commits October 6, 2023 17:08

Update ASAN config

299c022

Fix alloc test

fafc1d7

Adjust san flags

705d153

RalfJung closed this Oct 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial implementation of tests with sanitizers #21

Initial implementation of tests with sanitizers #21

tgross35 commented Oct 6, 2023

RalfJung commented Oct 6, 2023

RalfJung Oct 6, 2023

RalfJung commented Oct 6, 2023

tgross35 commented Oct 6, 2023

RalfJung commented Oct 6, 2023 •

edited

Loading

RalfJung commented Oct 6, 2023 •

edited

Loading

tgross35 commented Oct 6, 2023

tgross35 commented Oct 6, 2023 •

edited

Loading

saethlin commented Oct 6, 2023

tgross35 commented Oct 6, 2023 •

edited

Loading

tgross35 commented Oct 6, 2023

saethlin commented Oct 6, 2023 •

edited

Loading

saethlin commented Oct 6, 2023

tgross35 commented Oct 6, 2023 •

edited

Loading

saethlin commented Oct 6, 2023 •

edited

Loading

tgross35 commented Oct 6, 2023

RalfJung commented Oct 1, 2024

		esac


		# run the tests (some also without validation, to exercise those code paths in Miri)

Initial implementation of tests with sanitizers #21

Initial implementation of tests with sanitizers #21

Conversation

tgross35 commented Oct 6, 2023

RalfJung commented Oct 6, 2023

RalfJung Oct 6, 2023

Choose a reason for hiding this comment

RalfJung commented Oct 6, 2023

tgross35 commented Oct 6, 2023

RalfJung commented Oct 6, 2023 • edited Loading

RalfJung commented Oct 6, 2023 • edited Loading

tgross35 commented Oct 6, 2023

tgross35 commented Oct 6, 2023 • edited Loading

saethlin commented Oct 6, 2023

tgross35 commented Oct 6, 2023 • edited Loading

tgross35 commented Oct 6, 2023

saethlin commented Oct 6, 2023 • edited Loading

saethlin commented Oct 6, 2023

tgross35 commented Oct 6, 2023 • edited Loading

saethlin commented Oct 6, 2023 • edited Loading

tgross35 commented Oct 6, 2023

RalfJung commented Oct 1, 2024

RalfJung commented Oct 6, 2023 •

edited

Loading

RalfJung commented Oct 6, 2023 •

edited

Loading

tgross35 commented Oct 6, 2023 •

edited

Loading

tgross35 commented Oct 6, 2023 •

edited

Loading

saethlin commented Oct 6, 2023 •

edited

Loading

tgross35 commented Oct 6, 2023 •

edited

Loading

saethlin commented Oct 6, 2023 •

edited

Loading