-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Automate remaining graceful recovery tests #2140
Automate remaining graceful recovery tests #2140
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #2140 +/- ##
==========================================
+ Coverage 87.61% 95.00% +7.38%
==========================================
Files 96 1 -95
Lines 6695 220 -6475
Branches 50 50
==========================================
- Hits 5866 209 -5657
+ Misses 773 11 -762
+ Partials 56 0 -56 ☔ View full report in Codecov by Sentry. |
c22fe8c
to
330d50b
Compare
Because of the draining and deleting of the node, in addition to the restarting of the kind container. If this test fails/errors mid-process it will cause following tests to fail. If this is not okay I see two options
Does anyone have any thoughts on this, and if this is even something to worry about? |
|
Here are some sample errors I got from following tests when I drain the node, delete the node, but fail on restarting the docker container.
So it seems to fail on the setup section because we drain the node and delete it, but if we error there and don't complete it by restarting the container, the previous tests are left without a node. I'm not too sure of what we can do about this to mitigate with our AfterAll in the graceful-recovery test because if we failed on resetting inside the test, I think we would fail to cleanup/reset in the AfterAll. |
@bjee19 We currently run the telemetry test on its own in the pipeline, maybe we just do something similar with these couple of tests that require draining. It uses different labels than the rest. |
@sjberman So I added the graceful-recovery test to run on its own in the pipeline. The most recent run is a intentional failure. When this test fails, it no longer runs the rest of the functional tests, is that by design? I think if so then this solves the issue above, as any failure in the graceful recovery test that messes up with the kubernetes node or container will get caught in the graceful recovery test and won't propagate downwards to following tests because it will exit. |
@bjee19 Yeah, I think works fine. |
f3f82b9
to
ebc4b04
Compare
b867bef
to
9370b25
Compare
Automate remaining graceful recovery tests which involve restarting the Node which NGF is running on.
Problem: Need to automate the remaining graceful recovery tests.
Solution: Automate the remaining graceful recovery tests.
Testing: Test works correctly locally and on the pipeline.
Closes #1901
Checklist
Before creating a PR, run through this checklist and mark each as complete.
Release notes
If this PR introduces a change that affects users and needs to be mentioned in the release notes,
please add a brief note that summarizes the change.