Skip to content

Conversation

@ncdc
Copy link
Contributor

@ncdc ncdc commented Aug 7, 2023

No description provided.

@openshift-ci-robot openshift-ci-robot added jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. jira/invalid-bug Indicates that a referenced Jira bug is invalid for the branch this PR is targeting. labels Aug 7, 2023
@openshift-ci-robot
Copy link
Contributor

@ncdc: This pull request references Jira Issue OCPBUGS-17418, which is invalid:

  • expected the bug to target the "4.14.0" version, but no target version was set

Comment /jira refresh to re-evaluate validity if changes to the Jira bug are made, or edit the title of this pull request to link to a different bug.

The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot requested review from petr-muller and wking August 7, 2023 14:16
@petr-muller
Copy link
Member

/jira refresh

@openshift-ci-robot openshift-ci-robot added jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. and removed jira/invalid-bug Indicates that a referenced Jira bug is invalid for the branch this PR is targeting. labels Aug 7, 2023
@openshift-ci-robot
Copy link
Contributor

@petr-muller: This pull request references Jira Issue OCPBUGS-17418, which is valid. The bug has been moved to the POST state.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.14.0) matches configured target version for branch (4.14.0)
  • bug is in the state New, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @jiajliu

Details

In response to this:

/jira refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot requested a review from jiajliu August 7, 2023 14:26
Copy link
Member

@petr-muller petr-muller left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/hold

LGTM, holding because Lala also said he's gonna take a look

if !ok {
klog.Errorf("Unexpected type %T", obj)
return
}
Copy link
Member

@petr-muller petr-muller Aug 7, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good to me.

I have looked into our other DeleteFunc handlers and it seems like none of them need a similar treatment because they do not use the object from the informer but just queue something else for processing.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And just added 2763543 to make that more obvious.

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 7, 2023
@openshift-ci openshift-ci bot added lgtm Indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. and removed lgtm Indicates that a PR is ready to be merged. labels Aug 7, 2023
The exposure has been around since 4.8 added admin acks [1], and we're
not sure if watch-interruption has increased recently,
relevant-ConfigMap deletion has increased, and/or our panic-monitoring
has improved.

[1]: https://github.com/openshift/cluster-version-operator/pull/647/files#diff-45df4901f88b6867d7fbf7e50690f376812864a0b85ec80dda5f77e6e19097b9R396-R402

Signed-off-by: Andy Goldstein <[email protected]>
@wking
Copy link
Member

wking commented Aug 7, 2023

Run the suite that panicked, to ensure we excercise the adjusted ConfigMap delete handler (even if we probably miss the well-timed watch-interruption that triggered this panic):

/payload-job periodic-ci-openshift-release-master-ci-4.14-e2e-aws-sdn-serial

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 7, 2023

@wking: trigger 1 job(s) for the /payload-(job|aggregate) command

  • periodic-ci-openshift-release-master-ci-4.14-e2e-aws-sdn-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/6e37dea0-3538-11ee-9524-c74145fe9934-0

Copy link
Member

@LalatenduMohanty LalatenduMohanty left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

New commit message suggestion from @wking which looks good to me.

This is to hint to any future developers that there is no existing operand handling in the functions, and they need to add all of that themselves, including (hopefully they realize) DeletedFinalStateUnknown handling.  That's a fair bit of hope to get DeletedFinalStateUnknown in place, but calling that specific aspect out in a comment or something would risk them reading the comment and assuming we were giving them an exhaustive list of guards to consider, when they may need additional guards to match distant-future informer changes (they hopefully read [1]).  And the amount of churn expected for our other delete handlers is low, so investing even more time in trying to protect those future devs seems uneccessary.

[1]: https://pkg.go.dev/k8s.io/client-go/tools/cache#ResourceEventHandler

This is to hint to any future developers that there is no existing
event object handling in the functions, and they need to add all of that
themselves, including (hopefully they realize) DeletedFinalStateUnknown
handling.  That's a fair bit of hope to get DeletedFinalStateUnknown in
place, but calling that specific aspect out in a comment or something
would risk them reading the comment and assuming we were giving them an
exhaustive list of guards to consider, when they may need additional
guards to match distant-future informer changes (they hopefully read
[1]).  And the amount of churn expected for our other delete handlers is
low, so investing even more time in trying to protect those future devs
seems uneccessary.

[1]:
https://pkg.go.dev/k8s.io/client-go/tools/cache#ResourceEventHandler

Signed-off-by: Andy Goldstein <[email protected]>
Copy link
Member

@LalatenduMohanty LalatenduMohanty left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 7, 2023

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: LalatenduMohanty, ncdc, petr-muller

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:
  • OWNERS [LalatenduMohanty,petr-muller]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Aug 7, 2023
@wking
Copy link
Member

wking commented Aug 7, 2023

Serial rehearsal hiccuped while reporting Telemetry, but that's unrelated to this change, which suggests that the new code doesn't break under common ConfigMap-deletion conditions. We don't know for sure whether there was a watch hiccup overlapping with a deletion, but 🤷, we can keep an eye out for that in post-merge periodics.

/hold cancel

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 7, 2023
@openshift-merge-robot openshift-merge-robot merged commit 86e24d6 into openshift:master Aug 7, 2023
@openshift-ci-robot
Copy link
Contributor

@ncdc: Jira Issue OCPBUGS-17418: All pull requests linked via external trackers have merged:

Jira Issue OCPBUGS-17418 has been moved to the MODIFIED state.

Details

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants