*: use HTTPS when serving Ignition configs by crawford · Pull Request #3294 · coreos/tectonic-installer

crawford · 2018-06-14T22:44:03Z

No description provided.

coreosbot · 2018-06-14T22:44:04Z

Can one of the admins verify this patch?

crawford · 2018-06-14T22:44:13Z

Continuation of https://github.com/coreos-inc/tectonic-operators/pull/393.

crawford · 2018-06-15T00:18:24Z

retest this please

yifan-gu · 2018-06-15T01:44:03Z

retest this please.

enxebre · 2018-06-15T09:20:56Z

steps/assets/base/tectonic.tf

  kube_ca_key_pem              = "${local.kube_ca_key_pem}"
  kubelet_cert_pem             = "${local.kubelet_cert_pem}"
  kubelet_key_pem              = "${local.kubelet_key_pem}"
+  tnc_cert_pem                 = "${local.tnc_cert_pem}"


where are these locals being defined?

They were added in 37f29a5.

enxebre · 2018-06-15T09:21:59Z

installer/pkg/config-generator/ignition.go

-		}
+		u = func() *url.URL {
+			return &url.URL{
+				Scheme: "https",


this won't work for the bootstrap node request resolved to s3 as the tls aws certs does not know about this domain

Christ, we need to kill off this S3-pivot insanity.

crawford · 2018-06-15T21:31:04Z

The masters are still trying to use HTTPS to fetch the config. I have no idea why. This works on my local system.

yifan-gu · 2018-06-15T22:38:25Z

installer/pkg/config-generator/ignition.go

+	// XXX: The bootstrap node on AWS uses a CNAME to redirect TNC-bound
+	// traffic to S3. Because of this, HTTPS cannot be used.
+	scheme := "https"
+	if c.Platform == "AWS" && role == "master" {


https://github.com/coreos/tectonic-installer/blob/master/installer/pkg/config/cluster.go#L25

Also calling I think it's advised to c.Platform.String()

yifan-gu · 2018-06-15T22:51:33Z

modules/bootkube/resources/bootkube.sh

+    i=$((i+1))
+    [ $i -eq 10 ] && echo "etcdctl failed too many times." && exit 1
+
+    echo "etcdctl failed. Retrying in 5 seconds..."


Doesn't etcdctl endpoint health have its own retry logic?
cc @gyuho

I observed it failing to retry.

The retry logic would depend on the error types. Since endpoint health is just a simple get request, it's safe to retry here.

For some reason, bcrypt and blowfish were left around the last time glide was run.

This version implements the v2.2.0 config spec, which is needed in order to customize the certificate authorities. There were a number of code changes that needed to be made as well due to the change in types.

Instead of comparing against strings, this should use the platform constants.

This injects the root certificate authority into the stub configs and switches the TNC URL to HTTPS.

The new TNCO now uses HTTPS after bootstrap.

When this secret was created by tectonic.service, there was a race condition triggered when TNCO created TNC too quickly. tectonic.sh would get stuck waiting for all pods to get to the Running state because TNC would never leave the ContainerCreating state. TNC needed the TLS assets in order to complete, but those aren't created by tectonic.sh until after all of the pods are running. This moves the secret creation earlier in the process to avoid the race.

The call to etcdctl sometimes fails with a TCP error. Retry this operation a few times before giving up.

crawford · 2018-06-18T18:38:05Z

retest this please

yifan-gu · 2018-06-18T19:01:03Z

retest this please

yifan-gu · 2018-06-18T19:43:14Z

retest this please

enxebre · 2018-06-19T08:27:48Z

retest this please

Catching up with c71394d (vendor: bump Ignition to v0.26.0, 2018-06-14, coreos/tectonic-installer#3294). It would be nice if we could prune the vendored Go for older ignition config versions, but they're currently chained for translation: $ git grep coreos/ignition/config/v2_1 | grep '/v2_2/.*.go:' vendor/github.com/coreos/ignition/config/v2_2/config.go: "github.com/coreos/ignition/config/v2_1" vendor/github.com/coreos/ignition/config/v2_2/translate.go: v2_1 "github.com/coreos/ignition/config/v2_1/types" I've filed [1] to ask for that. [1]: coreos/ignition#602

crawford added platform/aws run-smoke-tests labels Jun 14, 2018

enxebre reviewed Jun 15, 2018

View reviewed changes

yifan-gu reviewed Jun 15, 2018

View reviewed changes

crawford added 7 commits June 18, 2018 09:27

vendor: re-run glide

b24fa61

For some reason, bcrypt and blowfish were left around the last time glide was run.

vendor: bump Ignition to v0.26.0

c71394d

This version implements the v2.2.0 config spec, which is needed in order to customize the certificate authorities. There were a number of code changes that needed to be made as well due to the change in types.

installer/pkg/config-generator: fix platforms

3f233ee

Instead of comparing against strings, this should use the platform constants.

installer/pkg/config-generator: switch to HTTPS

fe51256

This injects the root certificate authority into the stub configs and switches the TNC URL to HTTPS.

config.tf: bump operator images

0482a9f

The new TNCO now uses HTTPS after bootstrap.

modules/bootkube: make service more robust

b2c439b

The call to etcdctl sometimes fails with a TCP error. Retry this operation a few times before giving up.

enxebre approved these changes Jun 19, 2018

View reviewed changes

enxebre merged commit 17fbef2 into coreos:master Jun 19, 2018

crawford deleted the https branch June 19, 2018 17:08

wking mentioned this pull request Jul 23, 2018

installer/pkg/config/fixtures: Bump to version 2.2.0 openshift/installer#64

Merged

Conversation

crawford commented Jun 14, 2018

Uh oh!

coreosbot commented Jun 14, 2018

Uh oh!

crawford commented Jun 14, 2018

Uh oh!

crawford commented Jun 15, 2018

Uh oh!

yifan-gu commented Jun 15, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

crawford commented Jun 15, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

crawford commented Jun 18, 2018

Uh oh!

yifan-gu commented Jun 18, 2018

Uh oh!

yifan-gu commented Jun 18, 2018

Uh oh!

enxebre commented Jun 19, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants