Integrate criterion statistics and plots into CI #5354

konstin · 2023-06-25T11:24:49Z

For each PR, our CI compares running criterion benchmarks on main vs. running them on the PR and posts a comment with the timing changes. Unfortunately, the github actions runner performance is rather noisy, so e.g. in this change that doesn't touch the linter at all we still see large seeming changes to linter performance.

criterion, the benchmarking framework we use, integrates statistical methods and e.g. computes p-values that tells you whether a change is statistically significant (vs. the just noise) and it plots each run to allow for manual inspection. It would be great if we could extend CI to show more information, e.g. whether criterion considers the difference significant. I don't know if that is possible with the way github handles artifacts, but exporting the criterion plots and linking them on the PR comment would also be helpful.

charliermarsh · 2023-11-10T05:32:46Z

I think this is less relevant now that we're on CodSpeed, but can always revisit if we migrate away.

konstin added the performance Potential performance improvement label Jun 25, 2023

charliermarsh closed this as not planned Won't fix, can't repro, duplicate, stale Nov 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate criterion statistics and plots into CI #5354

Integrate criterion statistics and plots into CI #5354

konstin commented Jun 25, 2023

charliermarsh commented Nov 10, 2023

Integrate criterion statistics and plots into CI #5354

Integrate criterion statistics and plots into CI #5354

Comments

konstin commented Jun 25, 2023

charliermarsh commented Nov 10, 2023