HAProxy Router: Invert health-check idled check #10596

DirectXMan12 · 2016-08-23T15:33:56Z

The check for no endpoints got inverted at some point during the PR,
causing health checks to be enabled only when a service was idled,
instead of the other way around. This fixes that.

Fixes bug 1366180

smarterclayton · 2016-08-23T15:37:04Z

Since this regressed the product, please add an integration or e2e test that validates this is correct so it does not break in the future.

DirectXMan12 · 2016-08-23T15:37:40Z

cc @knobunc

DirectXMan12 · 2016-08-23T15:42:14Z

[test]

DirectXMan12 · 2016-08-23T17:29:59Z

Alright, I had an extended test which could potentially have been a bit flaky, but I've changed it so that it should not be (the default health check interval is 5 seconds, and I've up the "consistently" time period to 20 seconds). However, I do not think that extended test was running -- I just looked through a Jenkins result, and only saw networking and conformance extended tests being run.

@smarterclayton do we not run most of the extended tests? If that's the case, should I add a new section to the e2e test, or what?

smarterclayton · 2016-08-23T17:35:01Z

You should add [conformance] to your test. If you have fast, core function tests, always add conformance preemptively. If your tests aren't fast, make them fast.

knobunc · 2016-08-23T18:01:59Z

@smarterclayton Define fast? This is testing a negative. So the test has to sleep to make sure something doesn't happen within a reasonable amount of time.

DirectXMan12 · 2016-08-23T18:02:07Z

@smarterclayton these tests are somewhat necessarily slow (they run in about 43s per for the "idling only" on my machine) because we have to wait for endpoints to come up (may take a few seconds), and then wait at least 10s (currently 20s to be safe) in order to make sure we've gone through what would be 1 health check interval (5s) if the health checks were running, checking to make sure we don't get any pods coming up. I'm assuming anything reported as slow by the test runner is too slow to be marked as conformance, right?

knobunc · 2016-08-23T18:02:32Z

The change LGTM.

smarterclayton · 2016-08-23T18:04:52Z

43 seconds is not too slow. The test runner doesn't know what slow is really. Slow would be 2 minutes probably.

DirectXMan12 · 2016-08-23T18:16:15Z

@smarterclayton ack, I'll stick conformance on the appropriate ones.

DirectXMan12 · 2016-08-23T18:24:30Z

I've marked the entire idling test suite (~5-6m total) as "[Conformance]". If that's too much, I can just mark a couple of tests (which would be ~1m30s) that cover this case and basic unidling instead.

DirectXMan12 · 2016-08-23T18:24:37Z

[test]

smarterclayton · 2016-08-23T18:35:40Z

your budget is 2 minutes

DirectXMan12 · 2016-08-23T18:41:34Z

ack, I'll just make that two then

DirectXMan12 · 2016-08-23T18:45:16Z

alright, the basic idling with DC and basic unidling with TCP are now marked as conformance

The check for no endpoints got inverted at some point during the PR, causing health checks to be enabled *only* when a service was idled, instead of the other way around. This fixes that. Fixes bug 1366180

DirectXMan12 · 2016-08-23T19:01:55Z

[test]

openshift-bot · 2016-08-23T19:02:23Z

Evaluated for origin test up to b08af4a

smarterclayton · 2016-08-23T19:04:36Z

LGTM [merge]

openshift-bot · 2016-08-23T19:07:23Z

continuous-integration/openshift-jenkins/merge SUCCESS (https://ci.openshift.redhat.com/jenkins/job/test_pr_origin/8383/) (Image: devenv-rhel7_4914)

openshift-bot · 2016-08-23T19:07:23Z

Evaluated for origin merge up to b08af4a

openshift-bot · 2016-08-23T20:32:23Z

continuous-integration/openshift-jenkins/test FAILURE (https://ci.openshift.redhat.com/jenkins/job/test_pr_origin/8373/)

DirectXMan12 · 2016-08-23T20:37:42Z

Looks like a flake on the DC deployment conformance tests:

should run a deployment to completion and then scale to zero
Started deployment #4\nError from server: The get operation against ReplicationController could not be completed at this time, please try again.

smarterclayton · 2016-08-23T21:13:51Z

Please link the appropriate flake issue.

On Tue, Aug 23, 2016 at 4:37 PM, Solly Ross [email protected]
wrote:

Looks like a flake on the DC deployment conformance tests:

should run a deployment to completion and then scale to zero
Started deployment #4 https://github.com/openshift/origin/pull/4\nError
from server: The get operation against ReplicationController could not be
completed at this time, please try again.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#10596 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABG_p4xCDDR-gqJwYZANkEuMox1Xo5BXks5qi1oZgaJpZM4JrEMR
.

smarterclayton added the priority/P0 label Aug 23, 2016

smarterclayton added this to the 1.3.0 milestone Aug 23, 2016

DirectXMan12 force-pushed the bug/reencrypt-unidling-fixed branch from 7d3dd11 to e8454b6 Compare August 23, 2016 17:27

DirectXMan12 force-pushed the bug/reencrypt-unidling-fixed branch from e8454b6 to be3ba8c Compare August 23, 2016 18:21

DirectXMan12 force-pushed the bug/reencrypt-unidling-fixed branch from be3ba8c to b08af4a Compare August 23, 2016 18:44

HAProxy Router: Invert health-check idled check

b08af4a

The check for no endpoints got inverted at some point during the PR, causing health checks to be enabled *only* when a service was idled, instead of the other way around. This fixes that. Fixes bug 1366180

openshift-bot merged commit 6fd783f into openshift:master Aug 24, 2016

DirectXMan12 deleted the bug/reencrypt-unidling-fixed branch August 24, 2016 14:12

HAProxy Router: Invert health-check idled check #10596

HAProxy Router: Invert health-check idled check #10596

Uh oh!

Conversation

DirectXMan12 commented Aug 23, 2016

Uh oh!

smarterclayton commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smarterclayton commented Aug 23, 2016 via email

Uh oh!

knobunc commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

knobunc commented Aug 23, 2016

Uh oh!

smarterclayton commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

smarterclayton commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

openshift-bot commented Aug 23, 2016

Uh oh!

smarterclayton commented Aug 23, 2016

Uh oh!

openshift-bot commented Aug 23, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openshift-bot commented Aug 23, 2016

Uh oh!

openshift-bot commented Aug 23, 2016

Uh oh!

DirectXMan12 commented Aug 23, 2016

Uh oh!

smarterclayton commented Aug 23, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DirectXMan12 commented Aug 23, 2016 •

edited

Loading

openshift-bot commented Aug 23, 2016 •

edited

Loading