-
Notifications
You must be signed in to change notification settings - Fork 461
pkg/operator: report available when failing to progress #450
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: runcom The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
a583f1e to
aee0fbe
Compare
|
/hold cancel rebased |
aee0fbe to
1be24af
Compare
|
unit test flake #417 /retest |
|
new unit test flake #451 /retest |
|
/retest |
Signed-off-by: Antonio Murdaca <runcom@linux.com>
1be24af to
5158fe2
Compare
|
unit flake tracked here #449 /retest |
/retest |
|
I still think we shouldn't report available=true if we failed progressing as we may have touched some objects during syncFuncs (daemonsets and controllers) that may misbehave if they're at different versions... |
|
/hold because of my previous comment, seeking confirmation but my thoughts are:
|
|
@runcom I read the CVO docs as Available meaning it's at a specific version and functioning properly (for that version) somewhat divorced from the progressing & failing statuses. So that you could have an fail to progress but say a rollback that still left it available at a previous version or a failed progress that doesn't make anything degrade, etc.... However if the progressing failed, some elements were updated, some weren't and this will cause improper functioning then Available = False. |
|
This PR doesn't make sense for the MCO, when an upgrade for the MCO happens, this is gonna be roughly the flow:
Feel free to reopen or comment if my flow isn't right. |
- What I did
This PR
builds on top of #442 and itis the result of the conversation here: #442 (comment)The CVO docs tell us that we may still report Available=True when we fail to progress to a newer version and this patch accounts for that case. The unit test change is the fix to my assumption that if we fail progressing, we do not need to report available cause state may have changed already and the MCO's components aren't good anymore.
The patch still accounts for reporting Available=False when we fail to bootstrap.
@abhinavdahiya @smarterclayton ptal
- How to verify it
- Description for the changelog