test/e2e: More refactoring #765

cgwalters · 2019-05-16T14:02:21Z

Rather than poll all of the daemons, add a helper that waits
for a pool to complete a config.

One of our tests walks over the MCDs, change it to just assert
on all of the nodes.

The SSH test can also just wait for a pool and then rsh to
each node.

runcom · 2019-05-16T14:05:02Z

test/e2e/mcd_test.go

isn't thi racy if we jump here before the pool starts rolling? I believe that's why I went node by node confirming current is there and of the new pool

ok, you added a check below but this may flake then?

Just above that we do waitForRenderedConfig right?

For the record though I didn't test this much locally, created a PR to have CI do it.

oh yeah, I had an old master on my laptop and didn't notice that, cool yeah, no flake no race 🎉

cgwalters · 2019-05-16T16:20:26Z

Hum, so that run failed; added some more debug logs to help me figure it out. Offhand...I think there is a race here because the tests were relying on checking for config=X and status=Updated, but we don't update those atomically. Really...it feels like we should have Spec.Configuration and Status.Configuration just like other objects right?

kikisdeliveryservice · 2019-05-16T17:26:46Z

would checking if state is Done before checking that they match clarify what's happening (ie switch the 2 asserts?)

assert.Equal(t, constants.MachineConfigDaemonStateDone, node.Annotations[constants.MachineConfigDaemonStateAnnotationKey])

cgwalters · 2019-05-16T20:18:19Z

OK this test passes "locally" now - I added a time.Sleep(13*time.Second) as a temporary hack. Pretty sure the race is #765 (comment) - going to take a quick look at a PR for that.

cgwalters · 2019-05-16T21:19:51Z

Hooray, e2e-aws-op passed!

#773 should help us avoid the race.

kikisdeliveryservice · 2019-05-16T22:03:58Z

ive seen these weird e2e-aws-upgrade failures before, checking them out elsewhere. let's try again:
/test e2e-aws-upgrade

Rather than poll all of the daemons, add a helper that waits for a pool to complete a config. One of our tests walks over the MCDs, change it to just assert on all of the nodes. The SSH test can also just wait for a pool and then `rsh` to each node.

cgwalters · 2019-05-17T18:35:50Z

Rebased 🏄‍♂️ and CI is good, can I get a lgtm?

runcom · 2019-05-17T18:45:17Z

/lgtm

openshift-ci-robot · 2019-05-17T18:45:37Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cgwalters, runcom

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [cgwalters,runcom]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

See openshift#765 (comment) MachineConfigPool needs a `Spec.Configuration` and `Status.Configuration` [just like other objects][1] so that we can properly detect state. Currently there's a race because the render controller may set `Status.Configuration` while the pool's `Status` still has `Updated`, so one can't reliably check whether the pool is at a given config. With this, ownership is clear: the render controller sets the spec, and the node controller updates the status. [1] https://github.com/kubernetes/community/blob/master/contributors/devel/sig-architecture/api-conventions.md#spec-and-status)

openshift-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels May 16, 2019

openshift-ci-robot requested review from LorbusChris and kikisdeliveryservice May 16, 2019 14:02

runcom reviewed May 16, 2019

View reviewed changes

cgwalters force-pushed the test-cleanup branch from f4c406a to 938fe5f Compare May 16, 2019 16:11

cgwalters force-pushed the test-cleanup branch from 938fe5f to 13c2096 Compare May 16, 2019 20:17

cgwalters mentioned this pull request May 16, 2019

Add a Spec.Configuration to MachineConfigPool #773

Merged

test/e2e: More refactoring

9970f24

Rather than poll all of the daemons, add a helper that waits for a pool to complete a config. One of our tests walks over the MCDs, change it to just assert on all of the nodes. The SSH test can also just wait for a pool and then `rsh` to each node.

cgwalters force-pushed the test-cleanup branch from 13c2096 to 9970f24 Compare May 17, 2019 12:40

openshift-ci-robot assigned runcom May 17, 2019

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label May 17, 2019

openshift-merge-robot merged commit e861ccb into openshift:master May 17, 2019

cgwalters mentioned this pull request Jun 14, 2019

[release-4.1] Bug 1706082: Add Spec.Configuration to MachineConfigPool, render controller writes it #856

Merged

test/e2e: More refactoring #765

test/e2e: More refactoring #765

Uh oh!

Conversation

cgwalters commented May 16, 2019

Uh oh!

runcom May 16, 2019

Choose a reason for hiding this comment

Uh oh!

runcom May 16, 2019

Choose a reason for hiding this comment

Uh oh!

cgwalters May 16, 2019

Choose a reason for hiding this comment

Uh oh!

cgwalters May 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

runcom May 16, 2019

Choose a reason for hiding this comment

Uh oh!

cgwalters commented May 16, 2019

Uh oh!

kikisdeliveryservice commented May 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cgwalters commented May 16, 2019

Uh oh!

cgwalters commented May 16, 2019

Uh oh!

kikisdeliveryservice commented May 16, 2019

Uh oh!

cgwalters commented May 17, 2019

Uh oh!

runcom commented May 17, 2019

Uh oh!

openshift-ci-robot commented May 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cgwalters May 16, 2019 •

edited

Loading

kikisdeliveryservice commented May 16, 2019 •

edited

Loading