-
Notifications
You must be signed in to change notification settings - Fork 465
Bug 1794493: add ctrcfg e2e test #1414
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1794493: add ctrcfg e2e test #1414
Conversation
|
awesome 💪 |
|
uhm, not sure yet if failures in op tests are fully related /retest |
|
uhm I think e2e-gcp-op failures are now related |
c636455 to
4d3a597
Compare
|
updated to have the verification the rollback worked, as well as a bit more debugging logs to test the gcp-op problems |
4d3a597 to
276f5e9
Compare
|
/skip |
|
/retest |
1 similar comment
|
/retest |
|
@haircommander: This pull request references Bugzilla bug 1794493, which is valid. The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/retest |
|
e2e-gcp-op still related to this now that we're back with that |
|
yeah I've observed similar things when testing on my own. somehow the LogSizeMax value is being set to 0, and cri-o complains. Still working on it... |
a9437f3 to
b53d27b
Compare
/retest |
mtrmac
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The test LGTM (to the limited extent I understand MCO).
WRT explicitly setting all the CRI-O options:
- Neither this nor #1405 provides any rationale. Is there a consensus on this being the right thing?
- If it is necessary to explicitly set all of those options, is there a process in place to keep updating the templates as new options are added, or defaults change?
agreed, I am still working on figuring out all of the failures. Upon this being the final version (still wip), I'll update it for explainations, but as I can see it: The second problem is because we are using an outdated version of the CRI-O config, and there are now some values that, when populated, behave better (version file, for instance). So the vendor bump is to fix that. Finally, there's some weirdness with LogSizeMax. Somehow, it ends up as 0 after an update is applied. I still haven't gotten to the bottom of that, and once I do I'll add a test for each of the options, update the PR description, and remove the WIP
For now, it is necessary and there's no process. However, there's light at the end of the tunnel. What we really should do is utilize drop in config files and define a structure that is a subset of the options that openshift needs (only the ones we care about configuring). When a ctrcfg comes in, we update that file with its values (keeping most out of the config, and thus most defaulting to the system crio.conf) |
That would be great, and is a good reason not to invest too much in the current hardcode-all-default design — if the drop in solution can happen soon enough. If there’s a risk that this will persist over a CRI-O major release update (1.y to 1.y+1), it seems desirable to harden the MCO implementation, maybe to at build time call https://github.com/cri-o/cri-o/blob/eb14e0feb01971bf88b2591b744b64dd60272fe5/pkg/config/config.go#L485 to generate the templates. Or maybe do that in |
|
|
@haircommander: This pull request references Bugzilla bug 1794493, which is valid. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
2 similar comments
|
@haircommander: This pull request references Bugzilla bug 1794493, which is valid. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@haircommander: This pull request references Bugzilla bug 1794493, which is valid. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
there seems to still be some kind of issue/race right? 🤔 |
I don't know if it's an "issue". one could call the sleeps a bit hacky, but I don't know of a better way to make sure the operation has made it to the openshift api-server without them. |
oh no sorry, I meant to say the gpg-op test is still failing with something that I think it's related to this change uhm |
554420c to
96339d9
Compare
This reverts commit 69025e8. as well as updates crio.yaml templates with correct values and fixes whitespace Signed-off-by: Urvashi Mohnani <[email protected]> Signed-off-by: Peter Hunt <[email protected]>
Signed-off-by: Peter Hunt <[email protected]>
before, we weren't properly checking LogSizeMax was an empty value. As such, we incorrectly merged the config changes, which bricked the runtime when LogSizeMax wasn't defined. Fix this by using LogSizeMax uniformly by checking against its Value() method Signed-off-by: Peter Hunt <[email protected]>
this change includes: add test/e2e/ctrcfg_test that includes three basic tests for deploying a ctrcfg and seeing it correctly changed the runtime configuration move/refactor some code from test/e2e/mcd_test to test/e2e/utils_test to prevent duplication between the two files, while also being clear about what tests use what (utils currently houses functions used in both mcd and ctrcfg tests) Signed-off-by: Peter Hunt <[email protected]>
96339d9 to
ff58ebb
Compare
|
/retest |
|
🎉 ok now it's ready (assuming the aws failure is unrelated, which I think it is?) |
|
/retest |
|
/retest 🤞 |
|
@haircommander: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
🎉 yep, it looks like it's ok to merge - we'll fix any flake afterwards |
|
/skip we need a backport for this for 4.3 as well right? @umohnani8 @haircommander |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: haircommander, runcom The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
yep, backport here #1447 |
|
/bugzilla refresh |
- What I did
Added a ctrcfg e2e test, to hopefully begin to prevent regressions
To get this test to pass, some changes were also needed:
- How to verify it
go test -v -run="TestContainerRuntimeConfigPidsLimit" ./test/e2e/- Description for the changelog
added simple e2e test for ContainerRuntimeConfigs
revendor cri-o to have an updated config structure
Note: this is WIP because I haven't seen the test pass yet (as I was working without the required patch). Instead of waiting for a cluster to provision manually, I'll now start letting prow do it. I also want to refactor the changes a bit