tests/e2e: more debug info on dumps #357

willfindlay · 2022-08-23T13:31:11Z

This series adds a bunch more debug info to e2e test dumps. See commits.

tests/e2e/helpers/dumpinfo.go

kkourt

LGTM, just some minor comments

tests/e2e/helpers/dumpinfo.go

There were a few leaked goroutines in e2e framework due to the context never getting canceled at the end of the tests, breaking some of our assumptions. Refactor the runner to wrap the testenv and cancel the context at the end of the test. Signed-off-by: William Findlay <[email protected]>

In cases where the agent crashes during a test, we would be unable to retrieve metrics at the end (since the metrics server is now offline). This means that we lose out on some valuable debugging information to deduce what went wrong. To rectify this shortcoming, introduce some new logic to regularly dump Tetragon metrics during the test at a specific interval. With these changes in place, we now at least have a recent snapshot of metrics regardless of whether the metrics server can be contacted at the end of a test. Signed-off-by: William Findlay <[email protected]>

Create a new RunCommand helper and factor it out of the main body of the bpftool dump subroutine. This will enable us to reuse the same basic logic to run commands elsewhere with a nice dump of stdout and stderr. Signed-off-by: William Findlay <[email protected]>

In case the Tetragon pod crashes during a test, it's useful to grab the output of kubectl describe to help understand why. So let's add this to the test dump on failure. Signed-off-by: William Findlay <[email protected]>

It's useful to get a summary of pods in our cluster when a test fails so we can figure out if the reason for the failure might be incidental -- for example in cases where our workload fails to run correctly. Signed-off-by: William Findlay <[email protected]>

willfindlay · 2022-08-23T14:39:45Z

@kkourt I addressed your comments. Also realized at the same time that we were never actually cancelling our context in the tests, so I had to refactor the runner a bit (it's in the new first commit).

willfindlay requested a review from kkourt August 23, 2022 13:31

willfindlay requested a review from a team as a code owner August 23, 2022 13:31

kevsecurity reviewed Aug 23, 2022

View reviewed changes

tests/e2e/helpers/dumpinfo.go Show resolved Hide resolved

willfindlay force-pushed the pr/willfindlay/even-better-dumps branch 2 times, most recently from 6240623 to b04dd26 Compare August 23, 2022 13:48

kevsecurity approved these changes Aug 23, 2022

View reviewed changes

kkourt approved these changes Aug 23, 2022

View reviewed changes

tests/e2e/helpers/dumpinfo.go Show resolved Hide resolved

tests/e2e/helpers/dumpinfo.go Outdated Show resolved Hide resolved

willfindlay marked this pull request as draft August 23, 2022 14:31

willfindlay added 5 commits August 23, 2022 10:35

tests/e2e: dump output of kubectl describe for failed tests

6e68e2d

In case the Tetragon pod crashes during a test, it's useful to grab the output of kubectl describe to help understand why. So let's add this to the test dump on failure. Signed-off-by: William Findlay <[email protected]>

willfindlay force-pushed the pr/willfindlay/even-better-dumps branch from b04dd26 to 6db73d0 Compare August 23, 2022 14:39

willfindlay marked this pull request as ready for review August 23, 2022 14:49

kkourt merged commit d8bb3c5 into main Aug 23, 2022

kkourt deleted the pr/willfindlay/even-better-dumps branch August 23, 2022 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests/e2e: more debug info on dumps #357

tests/e2e: more debug info on dumps #357

willfindlay commented Aug 23, 2022

kkourt left a comment

willfindlay commented Aug 23, 2022 •

edited

Loading

tests/e2e: more debug info on dumps #357

tests/e2e: more debug info on dumps #357

Conversation

willfindlay commented Aug 23, 2022

kkourt left a comment

Choose a reason for hiding this comment

willfindlay commented Aug 23, 2022 • edited Loading

willfindlay commented Aug 23, 2022 •

edited

Loading