Add performance regression tests in CI #4701

kaukabrizvi · 2024-08-10T05:21:00Z

Description of changes:

This PR integrates the regression tests from #4667 into the s2n-tls CI pipeline. The regression_ci.yml workflow would now run on any pull request to main, comparing the performance of all harnesses in tests/regression against the mainline performance. The test will fail if the performance regression exceeds the predefined threshold (currently set to a constant, which will be updated after merging #4698). The GitHub Actions job (example flow) uploads the performance results for the PR, the mainline, and their differences as artifacts, accessible to the developer regardless of the test outcome. This aids in debugging performance issues that may cause test failures.

Call-outs:

Valgrind Installation: The tests require Valgrind, but the version available via apt install valgrind on the runner is outdated and lacks the necessary cachegrind functionality. To address this, Valgrind 3.23 is installed from source to ensure compatibility with the regression tests.
Mainline vs. PR Comparison: In tests/regression/src/lib.rs, I updated the comparison logic to correctly handle mainline vs. PR branches. Previously, the check was based on whether both commits were in the same log, which isn't applicable to CI integration. Now, the logic checks if one of the versions is the mainline, treating it as the older version. If both or neither are mainline, the check determines which commit is older. This ensures the diff profile accurately represents PR - Mainline performance.

This workflow was tested on my personal branch by creating pull requests to my personal branch and verifying the expected outcomes for pass/failure in the performance CI. It should also run on this PR, allowing reviewers to observe the logs in the checks below as 'Performance Regression Test / regression-test (pull_request)'

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

jmayclin · 2024-08-12T17:42:00Z

.github/workflows/regression_ci.yml

+          git fetch origin main
+          git checkout main
+
+      # Regenrate bindings for main branch


Suggested change

# Regenrate bindings for main branch

# Regenerate bindings for main branch

tests/regression/src/scratch

.github/workflows/regression_ci.yml

tests/regression/src/lib.rs

kaukabrizvi · 2024-08-12T21:05:27Z

Since I changed the file storage scheme to target/regression_artifacts and that change isn't currently in mainline. This PR fails the regression CI check because when we switch to mainline, files are stored in the old format which the PR branch doesn't recognize. Which is why you get the faliure in CI since the diff test only recognizes one folder stored in the expected location (the PR branch performance).

jmayclin · 2024-08-13T01:06:17Z

tests/regression/src/lib.rs

@@ -160,18 +186,28 @@ mod tests {
                test_name: test_name.to_string(),
                commit_hash: git::extract_commit_hash(&raw_files[0]),
            };
-
+            println!("{}", profile1.test_name);


Nit: I think you forgot to remove this?

.github/workflows/regression_ci.yml

maddeleine

It seems like the workflow that you're adding in this PR is failing?

tests/regression/src/lib.rs

.github/workflows/regression_ci.yml

maddeleine · 2024-08-13T20:16:42Z

tests/regression/src/lib.rs

-                (profile1, profile2)
-            } else if git::is_older_commit(&profile2.commit_hash, &profile1.commit_hash) {
-                (profile2, profile1)
+            if git::is_mainline(&profile1.commit_hash) ^ git::is_mainline(&profile2.commit_hash) {


I'm not really following the logic to return the correct tuple here. What is the result you're trying to return in this function? Why do you need an xor?

What is the result you're trying to return in this function?

You can think of this as a sort on the commits:

Whichever commit is mainline appears first

If is_mainline() is equivalent for both commits, the older commit appears first

This is necessary for the diff functionality to know which version is the standard and which version we want to test for comparison. Between mainline and PR branch, mainline should be the standard for comparison, otherwise the older commit between two PR's which are on the same branch should be the standard for comparison to the new commit.

Why do you need an xor?

The xor returns true if exactly one of the commits is mainline, then we can return whichever commit is mainline first. If both or neither are mainline the condition evaluates to false, then we must check for which commit is older, so we move to the else condition in that case which checks for which commit is older.

I will add comments to this function to make this more clear in the code.

It just seems like we don't need to be guessing at which commit is our "baseline". Doesn't the caller of these tests always know which commit is the baseline versus which one is the altered code? Like, you should always know I want to know the performance change that occurs from "this commit" to "this commit".

Initially, the approach was to have the caller identify "baseline" and "altered" as an environment variable upon invocation, similar to your suggestion. However after some discussion, we decided to solely rely on the commit id to auto detect new/old and in CI, branch/mainline. This means that with solely the commit id's available to identify files we have to figure out which commit ID is "baseline" vs "altered", which this logic attempts to solve. We could add back an environment variable for the caller to set "baseline" and "altered" to make the tool more flexible in its usage but if we base profiles on solely their commit id, the current approach is needed.

Hmmm that's a bummer, I kind of disagree with that approach because this logic looks pretty brittle as it is now. I think it would help for me to know what exactly is our goal here, do we expect this to automatically detect the right commit both when running this locally and running this in CI? Like, is the logic here only for running in CI or is it also trying to work for running the tests locally?

The logic here attempts to cover both, running locally and in CI. Locally, we would want the older of two commits as "baseline" and in CI, we would want main to be "baseline". I agree with you though that the logic is brittle, especially for local testing since I can imagine scenarios where the caller wants to compare two commits that are not on the same branch or set the newer commit as "baseline". To make the usage a bit more extensible, I think we should add an identifier to let the caller decide, so that the artifacts produced are baseline/commit_hash.test_name.raw and altered/commit_hash.test_name.raw. I would vote to do that in a separate PR though.

Yeah then I think you should just make this logic to work in CI. So it would just be like:

if (profile1.is_mainline()) { (profile1, profile2) } else { (profile2, profile1) }

I don't think there's much point in commiting complex logic if you're going to rip it out in a different PR.

Given the direction that integv2 went, I think it's important to keep things at least theoretically runnable locally. Also makes putting demos together much easier, etc.

I mean, if I am running this locally, I would probably create a branch with my changes and want to compare that branch to mainline. So even then, I really would only care about the difference between mainline and my branch.

That being said, the consequences of getting the order wrong are very minor. But I do think this code should be cleaned up in the future.

kaukabrizvi · 2024-08-13T21:10:18Z

@maddeleine

It seems like the workflow that you're adding in this PR is failing?

Since I changed the file storage scheme to target/regression_artifacts and that change isn't currently in mainline. This PR fails the regression CI check because when we switch to mainline, files are stored in the old format which the PR branch doesn't recognize when diffing.

I made a change to disable running this check when changes are made to the regression crate. However, the paths-ignore doesn't apply here because all paths that are changing must be present in the paths-ignore. We are also making changes to a file outside of tests/regression so this PR doesn't qualify. I don't think we should add .github/workflows/regression_ci.yml to paths-ignore because the only way to test changes to the .yml file are for it to run as a check in CI. So, it will fail for this PR which is alright because it is not currently a mandatory check, but for any future PR's which make a change solely to tests/regression, this test will not run to avoid issues like the one described above.

kaukabrizvi and others added 21 commits August 8, 2024 21:34

Added GA workflow for regression testing

969c31f

Update branch to personal for testing

ac4195d

Added comments

9f47298

Renamed GA file

c23002e

Checkout personal for testing

6493239

Mainline check in query function

7a55923

Added create directory for diff files

46a691b

Fixed annotated output file path

0d5ed11

Added always to upload on every run

d8fb254

Changed stat directory

c927f87

Debug statements in workflow

7a906fe

Fixed directory location

8661fa1

Deal with moving multiple files at once

1bce5ad

Get full commit id

b9a88a7

Update output directory to perf_outputs

0e9fb3e

Simplified output file paths to avoid redundancy

6e68c0b

Split mainline search into two steps

ddca12a

Merge branch 'aws:main' into regression-ci

47ab67c

Remove cargo build, detail comments

1be1700

Increase regression threshold

f427602

Change 'personal' to 'main'

cb336c5

kaukabrizvi marked this pull request as ready for review August 11, 2024 19:25

jmayclin reviewed Aug 12, 2024

View reviewed changes

kaukabrizvi added 6 commits August 12, 2024 19:33

Address PR feedback

f23f4cf

Fix get commit hash for new file storage scheme

d91952c

Change checkout to switch in comments

149c300

Change switch to checkout for PR checkout

bf45366

Changed switch to checkout in mainline checkout

650416c

Change checkout back to switch for mainline

1f0272e

Fix formatting

124d01c

jmayclin requested a review from maddeleine August 13, 2024 00:40

jmayclin reviewed Aug 13, 2024

View reviewed changes

kaukabrizvi added 2 commits August 13, 2024 02:04

Ignore test when changes are made to regression crate

e852dcf

Fix file path for path-ignore

5f5344d

kaukabrizvi requested a review from jmayclin August 13, 2024 19:52

maddeleine reviewed Aug 13, 2024

View reviewed changes

kaukabrizvi added 2 commits August 13, 2024 21:40

Clarify is_mainline and valgrind installation

d15d6e0

Simplify valgrind installation comment

f596a6f

kaukabrizvi requested a review from maddeleine August 13, 2024 22:10

maddeleine approved these changes Aug 16, 2024

View reviewed changes

jmayclin approved these changes Aug 16, 2024

View reviewed changes

kaukabrizvi and others added 4 commits August 16, 2024 08:52

Merge branch 'main' into regression-ci

fad287d

Fix merge conflicts

6c40862

Update actions to latest version

79ae666

Merge branch 'main' into regression-ci

222ad90

maddeleine merged commit 87f4a05 into aws:main Aug 19, 2024
35 of 36 checks passed

BrewTestBot mentioned this pull request Aug 20, 2024

s2n 1.5.1 Homebrew/homebrew-core#181832

Merged

kaukabrizvi deleted the regression-ci branch August 21, 2024 20:52

kaukabrizvi mentioned this pull request Aug 22, 2024

Simplify git logic in regression tests #4725

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add performance regression tests in CI #4701

Add performance regression tests in CI #4701

kaukabrizvi commented Aug 10, 2024 •

edited

Loading

jmayclin Aug 12, 2024

kaukabrizvi commented Aug 12, 2024 •

edited

Loading

jmayclin Aug 13, 2024

maddeleine left a comment

maddeleine Aug 13, 2024

kaukabrizvi Aug 13, 2024 •

edited

Loading

maddeleine Aug 14, 2024

kaukabrizvi Aug 15, 2024 •

edited

Loading

maddeleine Aug 15, 2024 •

edited

Loading

kaukabrizvi Aug 15, 2024

maddeleine Aug 15, 2024

jmayclin Aug 15, 2024

maddeleine Aug 16, 2024 •

edited

Loading

kaukabrizvi commented Aug 13, 2024

	# Regenrate bindings for main branch
	# Regenerate bindings for main branch

Add performance regression tests in CI #4701

Add performance regression tests in CI #4701

Conversation

kaukabrizvi commented Aug 10, 2024 • edited Loading

Description of changes:

Call-outs:

jmayclin Aug 12, 2024

Choose a reason for hiding this comment

kaukabrizvi commented Aug 12, 2024 • edited Loading

jmayclin Aug 13, 2024

Choose a reason for hiding this comment

maddeleine left a comment

Choose a reason for hiding this comment

maddeleine Aug 13, 2024

Choose a reason for hiding this comment

kaukabrizvi Aug 13, 2024 • edited Loading

Choose a reason for hiding this comment

maddeleine Aug 14, 2024

Choose a reason for hiding this comment

kaukabrizvi Aug 15, 2024 • edited Loading

Choose a reason for hiding this comment

maddeleine Aug 15, 2024 • edited Loading

Choose a reason for hiding this comment

kaukabrizvi Aug 15, 2024

Choose a reason for hiding this comment

maddeleine Aug 15, 2024

Choose a reason for hiding this comment

jmayclin Aug 15, 2024

Choose a reason for hiding this comment

maddeleine Aug 16, 2024 • edited Loading

Choose a reason for hiding this comment

kaukabrizvi commented Aug 13, 2024

kaukabrizvi commented Aug 10, 2024 •

edited

Loading

kaukabrizvi commented Aug 12, 2024 •

edited

Loading

kaukabrizvi Aug 13, 2024 •

edited

Loading

kaukabrizvi Aug 15, 2024 •

edited

Loading

maddeleine Aug 15, 2024 •

edited

Loading

maddeleine Aug 16, 2024 •

edited

Loading