Skip to content

Conversation

@sallyom
Copy link
Contributor

@sallyom sallyom commented Nov 26, 2018

@crawford
This PR

  • adds info about checking for nodes, waiting for workers upon install completion
  • adds an error return if bootstrap process exceeds the timeout (rather than returning nil)

@droslean noticed the successful logComplete msg was shown after an install exited due to timeout exceeded, since that error was not being returned. Any reason it should not return the error there?
EDIT: @wking suggested revert #615 now that we're comfortable with the robustness of #606 below.

@openshift-ci-robot openshift-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Nov 26, 2018
@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: sallyom
To fully approve this pull request, please assign additional approvers.
We suggest the following additional approver: abhinavdahiya

If they are not already assigned, you can assign the PR to them by writing /assign @abhinavdahiya in a comment when ready.

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@sallyom sallyom force-pushed the clarify-install-complete-msg branch 2 times, most recently from 36c8487 to bdc829e Compare November 26, 2018 19:20
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why ignore the other kinds of errors? I'd rather just revert #615 now that we're comfortable with the robustness of #606.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sounds good to me. I'll remove this from this PR, keep the log msg edits in.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this the recommended approach? It seems like oc get nodes -w might be more efficient (no polling). But it wouldn't be worker-specific. Maybe you could address that with --selector? Or maybe there are likely to be few enough masters (compared to workers) in most (all?) clusters that you don't need to bother filtering the masters out here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was going to ask (you) what the most efficient way was :) but do you agree that suggesting that one wait for workers to be created is appropriate?

@sallyom sallyom force-pushed the clarify-install-complete-msg branch from bdc829e to f62e75f Compare November 27, 2018 15:08
@openshift-ci-robot openshift-ci-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Nov 27, 2018
@openshift-ci-robot
Copy link
Contributor

@sallyom: The following tests failed, say /retest to rerun them all:

Test name Commit Details Rerun command
ci/prow/e2e-aws f62e75f link /test e2e-aws
ci/prow/e2e-libvirt f62e75f link /test e2e-libvirt

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@abhinavdahiya
Copy link
Contributor

IMO the installer can bring up a cluster with zero workers. So not sure this is going to be always correct.

@sallyom
Copy link
Contributor Author

sallyom commented Nov 27, 2018

@abhinavdahiya yes, you're right there. I'm closing this, the real reason this was opened was to error out instead of complete successfully if the bootstrap process exceeded the timeout. @wking suggested that reverting #615 is a better approach. closing... although the wording of Run 'export KUBECONFIG=%s' to manage your cluster. bugs me every time i see it and I'll find another PR to change it to to manage the installed cluster.

@sallyom sallyom closed this Nov 27, 2018
@wking
Copy link
Member

wking commented Nov 27, 2018

I've filed #741 reverting #615.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants