Create a resolver benchmarking suite. #9935

ehuss · 2021-09-22T03:10:45Z

This issue is for tracking and exploring creating a suite of benchmarks for the resolver so that we can better evaluate changes.

There are several different things worth benchmarking. Some rough ideas:

Creating a Cargo.lock from scratch.
Re-running the resolver when Cargo.lock already exists. (This is probably the most important one.)
Pathological edge cases, including those that result in errors. I don't know what are good candidates, maybe @Eh2406 has some ideas?
Covering resolve_ws_with_opts which runs the resolver and does a bunch of other work.

It may be useful to test against some real-world projects and some synthesized ones. The benchmarking suite can probably grab a snapshot of the crates.io index at a specific commit (cloning https://github.com/rust-lang/crates.io-index at a specific commit). It probably shouldn't be too hard to just capture the Cargo.toml files from some real-world projects so we can create some lightweight tests. Some real-world projects that I have used are (along with approximate number of deps):

empty project: 0
toml: 14
cargo: 130
rust: 518
tikv: 552
firefox: 577
diem: 653
servo: 658
paritytech/substrate: 896

I'm not sure which benchmarking libraries would be good to use. Criterion seems nice, but maybe others have other suggestions.

We may also want to create benchmarks for overall overhead (time to run cargo build with a project that is in a "fresh" state where no builds are necessary). This would cover several parts:

Process startup.
Config loading.
Workspace loading.
Resolver.
New feature resolver.
Generating units.
Scanning fingerprints.

I often do this with hyperfine on the projects listed above. It might be nice to make this easier to do, so something to keep in mind when making the benchmarks above.

@Eh2406 and @alexcrichton, if you have any other ideas or thoughts about what would be good to do, please include them here.

I expect this to be implemented in incremental steps. That is, we don't need a perfect benchmarking suite that covers everything all at once. Just automating a few real-world tests would be a good first step. I also expect these to be manually run by Cargo developers in an ad-hoc fashion at first. I don't think we have the facilities to do anything automated.

The text was updated successfully, but these errors were encountered:

ehuss added A-testing-cargo-itself Area: cargo's tests E-medium Experience: Medium labels Sep 22, 2021

epage mentioned this issue Sep 22, 2021

Performance test against toml-rs toml-rs/toml#132

Closed

3 tasks

Eh2406 mentioned this issue Sep 23, 2021

Drop the im-rc dependency #9878

Closed

ehuss self-assigned this Sep 29, 2021

ehuss mentioned this issue Oct 2, 2021

Add the start of a basic benchmarking suite. #9955

Merged

bors closed this as completed in c8b38af Oct 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a resolver benchmarking suite. #9935

Create a resolver benchmarking suite. #9935

ehuss commented Sep 22, 2021

Create a resolver benchmarking suite. #9935

Create a resolver benchmarking suite. #9935

Comments

ehuss commented Sep 22, 2021