E2E: waiting for services to be deleted before proceeding#2157
Conversation
| return nil | ||
| }, deleteOperationTimeout, retryableOperationSleepBetweenRetries).Should(Succeed()) | ||
| }, retryableOperationTimeout, retryableOperationSleepBetweenRetries).Should(Succeed()) | ||
| Logf("waiting for the external LB service to be deleted: %s", elbService.Name) |
There was a problem hiding this comment.
did this to be consistent but need to check if it adds a lot of time to e2e duration since we don't strictly need to wait on this one (it's the last service to be reconciled)
There was a problem hiding this comment.
Interesting, so we have actually not been validating ELB delete (ILBs and ELBs are ultimately the same Azure LoadBalancer type, so not super critical, but I agree it's best-practice to not make an exception here)
|
/assign @jackfrancis |
|
@jackfrancis I got this flake: which suggests a new flake after #2123 merged /retest |
|
@CecileRobertMichon indeed. I'll open a PR to disable those drain tests. Recall they were never actually running w/ validation turned on, so we need to go through the process of tuning them and actually implementing them in a useable way. |
CecileRobertMichon
left a comment
There was a problem hiding this comment.
/approve
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: CecileRobertMichon The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
/kind failing-test
What this PR does / why we need it: following up on this slack thread, the Delete() client doesn't actually wait for a service to be deleted, it simply triggers the delete. cloud-provider-azure has a Finalizer on the Service resource as it needs to reconcile load balancers before the service can be deleted, which means it can take several minutes (sometimes even more) for the service to actually be gone. However, cloud-provider only reconciles one service at a time. This means before the ELB service starts reconciling, the ILB service first needs to be done deleting. By not waiting for the ILB to fully delete, we run into occasional flakes as the timeout for creating the ELB service also includes deleting the previous service.
Which issue(s) this PR fixes (optional, in
fixes #<issue number>(, fixes #<issue_number>, ...)format, will close the issue(s) when PR gets merged):Fixes #
Special notes for your reviewer:
Please confirm that if this PR changes any image versions, then that's the sole change this PR makes.
TODOs:
Release note: