ClusterPool:ClusterDeployment version & condition #1484

2uasimojo · 2021-08-03T21:50:30Z

This commit adds a mechanism for calculating a hash representing the following fields in ClusterPool.Spec:

Platform
BaseDomain
ImageSetRef
InstallConfigSecretTemplateRef

When creating a new ClusterDeployment for the pool, this hash is stored in a new annotation on the ClusterDeployment.

The clusterpool controller checks all unassigned (installing or ready) ClusterDeployments and sets a new condition on the ClusterPool. This condition is:

"True" when all CDs are up to date with the pool (including the edge case where there are no CDs);
"False" when one or more CDs are stale;
"Unknown" if the annotation is missing from one or more CDs.
(Also "Unknown" if we haven't had a chance to figure it out yet.)

Of note:

When "False" or "Unknown", the condition's Message contains a list of the offending CDs (in the spirit of ClusterSync's "Failed" condition).
"False" takes precedence over "Unknown" (including: we don't try to combine the CD lists in the Message).

HIVE-1058

2uasimojo · 2021-08-03T21:51:28Z

This is currently stacked on #1474.

/hold

until that merges. Meanwhile, review only the second commit.

2uasimojo · 2021-08-03T21:52:42Z

/test e2e

/test e2e-pool

2uasimojo · 2021-08-03T22:00:06Z

/cc @gurnben @stevekuznetsov @abutcher @abhinavdahiya
per slack

2uasimojo · 2021-08-03T22:04:22Z

/test e2e-pool

2uasimojo · 2021-08-03T22:20:26Z

/retest

2uasimojo · 2021-08-03T22:34:23Z

These test failures are bogus*. No idea what's going on here. I even tried deleting the builds in the CI env. Anyway, I imagine we'll kick something over when this gets rebased after #1474 merges.

*except for e2e-pool, duh -- #1482 hasn't merged yet 🙄

abhinavdahiya · 2021-08-04T18:00:57Z

go.mod

 	github.com/aws/aws-sdk-go v1.38.41
 	github.com/blang/semver/v4 v4.0.0
 	github.com/davecgh/go-spew v1.1.1
+	github.com/davegardnerisme/deephash v0.0.0-20210406090112-6d072427d830


maybe the use what we do elsewhere in the code,

hive/pkg/operator/hive/controllersconfig.go

Lines 86 to 91 in 57147b2

hasher := md5.New()

hasher.Write([]byte(fmt.Sprintf("%v", hiveControllersConfigMap.Data)))

for _, h := range additionalControllerConfigHashes {

hasher.Write([]byte(h))

}

return hex.EncodeToString(hasher.Sum(nil))

I'll look into that. Is %v guaranteed to produce the same thing for (different instances of) identical objects? I know some languages don't/didn't guarantee map ordering for example.

So yeah, I can't prove that %v breaks with map key order, but I can prove it doesn't play nice with pointers, whereas deephash does The Right Thing™: https://play.golang.org/p/1SQUEeJfOND

abhinavdahiya · 2021-08-04T18:43:02Z

apis/hive/v1/clusterpool_types.go

+	// gives a quick indication as to whether the ClusterDeployment's configuration matches that of
+	// the pool.
+	// +optional
+	PoolVersion string `json:"poolVersion,omitempty"`


Version to me has a lot of baggage in terms of the format, an opaque digest doesn't fit well. Can we use some other name here?
🤔

I was following the naming of ResourceVersion, which is a close corollary to this field: an opaque string that you're only supposed to compare for (in)equality to see if something has changed. Open to other suggestions.

I named the annotation cluster-pool-spec-hash. There are still vestiges of poolVersion in comments, variable names, method names. Hope that's acceptable. (I really didn't want to be naming variables clusterPoolSpecHash...)

apis/hive/v1/clusterdeployment_types.go

pkg/controller/clusterpool/collections.go

codecov · 2021-08-04T19:45:15Z

Codecov Report

Merging #1484 (57147b2) into master (855a640) will decrease coverage by 0.11%.
The diff coverage is n/a.

❗ Current head 57147b2 differs from pull request most recent head 3dc3c3c. Consider uploading reports for the commit 3dc3c3c to get more accurate results

@@            Coverage Diff             @@
##           master    #1484      +/-   ##
==========================================
- Coverage   41.35%   41.23%   -0.12%     
==========================================
  Files         335      334       -1     
  Lines       30455    30265     -190     
==========================================
- Hits        12594    12481     -113     
+ Misses      16782    16722      -60     
+ Partials     1079     1062      -17

Impacted Files	Coverage Δ
pkg/controller/utils/utils.go	`79.14% <0.00%> (-0.33%)`	⬇️
.../clusterdeployment/clusterdeployment_controller.go	`59.92% <0.00%> (-0.09%)`	⬇️
...controller/clusterclaim/clusterclaim_controller.go	`60.59% <0.00%> (-0.07%)`	⬇️
pkg/controller/utils/errorscrub.go	`100.00% <0.00%> (ø)`
.../openshift/hive/apis/hive/v1/clusterclaim_types.go	`100.00% <0.00%> (ø)`
pkg/controller/clusterpool/collections.go
...roller/argocdregister/argocdregister_controller.go	`51.98% <0.00%> (+0.50%)`	⬆️
...g/controller/clusterpool/clusterpool_controller.go	`59.48% <0.00%> (+5.89%)`	⬆️

2uasimojo · 2021-08-06T23:08:21Z

Updated.

This is still stacked on #1474, which is now stacked on #1489, so I'll keep the hold. Review just the last commit.

2uasimojo · 2021-08-07T14:10:39Z

/test e2e

abhinavdahiya · 2021-08-10T17:12:36Z

pkg/controller/clusterpool/clusterpool_controller.go

 }

+func calculatePoolVersion(clp *hivev1.ClusterPool) string {
+	ba := []byte{}


since we store the hashes in the annotation, what is the expected max length of the hash? the append here makes it seems like it grows with more fields we include?

Correct. Each individual hash is 64 bits == 16 hex chars, so right now the value is 64c. And yes, it would grow if we extended it to include more fields. Fortunately for us, there's effectively no limit to the length of an annotation value :)

(One thing that's sort of neat about concatenating the hashes of the individual fields is that you can tell, with a little bit of work, which field(s) changed. Not that I think we should advertise this as a feature or anything, but it may come in handy some day.)

Note that if we ever did decide to add new fields, installing that version of the code would spuriously invalidate the previous CDs no matter what formatting choices we make, now or then. So if we decide we want to include a dozen fields and we feel like a 192c hash is just too much and want to do something like use a hash-of-hashes instead, we can switch over at that time and it wouldn't be any worse.

Fortunately for us, there's effectively no limit to the length of an annotation value :)

even though there is no limitation, i would expect that our hash is capped/constant at a length like most hashes.. our hashed value that keep increasing in length just looks odd and might create confusion in future imo.

Changed to hash the hashes, which fixes the length at 16c.

(ed. Notwithstanding what I said above, anyone confused about the actual value of the string shouldn't be looking at the string. It's supposed to be an opaque value -- again like kube's resourceVersion.)

pkg/controller/clusterpool/clusterpool_controller.go

pkg/controller/clusterpool/collections.go

abhinavdahiya · 2021-08-10T17:30:35Z

apis/hive/v1/clusterpool_types.go

 	ClusterPoolCapacityAvailableCondition ClusterPoolConditionType = "CapacityAvailable"
+	// ClusterPoolClusterDeploymentsCurrentCondition indicates whether all unassigned (installing or ready)
+	// ClusterDeployments in the pool match the current configuration of the ClusterPool.
+	ClusterPoolClusterDeploymentsCurrentCondition ClusterPoolConditionType = "ClusterDeploymentsCurrent"


hmm.. 2 things that came to my mind.

having the condition name include ClusteDeployments seems to tie user-facing thing to internal implementation of how clusters are created, if there is something else that provide clusters it might be difficult to move away from this name.

Current suffix does not immediately give the idea that this conditions allows us to see that the clusters we have are using the latest desired state. at least imo.
🤔

"ClusterPools manage ClusterDeployments" is not an internal implementation detail at all. It's literally the first sentence in the doc. Maybe I'm misunderstanding your comment.

Current suffix does not immediately give the idea

"Current" in the sense of "Up to date" -- see definition #4. I'm open to suggestions here, noting that this variable name is already pretty long and mealy. ClusterPoolOwnedUnclaimedClusterDeploymentsInSyncWithPoolSpecCondition may be more precise, but...

personally my preference is to not include ClusterDeployments in the condition name.. so the best i could come up was AllClustersCurrent

Saying that the clusters are current implies a lot more than we're actually verifying. And the word "All" confuses the fact that we're only validating unclaimed CDs.

But I've changed it per your suggestion.

pkg/controller/clusterpool/clusterpool_controller_test.go

2uasimojo

Thanks for the review @abhinavdahiya!

2uasimojo · 2021-08-11T14:52:19Z

apis/hive/v1/clusterpool_types.go

 	ClusterPoolCapacityAvailableCondition ClusterPoolConditionType = "CapacityAvailable"
+	// ClusterPoolClusterDeploymentsCurrentCondition indicates whether all unassigned (installing or ready)
+	// ClusterDeployments in the pool match the current configuration of the ClusterPool.
+	ClusterPoolClusterDeploymentsCurrentCondition ClusterPoolConditionType = "ClusterDeploymentsCurrent"


"ClusterPools manage ClusterDeployments" is not an internal implementation detail at all. It's literally the first sentence in the doc. Maybe I'm misunderstanding your comment.

Current suffix does not immediately give the idea

"Current" in the sense of "Up to date" -- see definition #4. I'm open to suggestions here, noting that this variable name is already pretty long and mealy. ClusterPoolOwnedUnclaimedClusterDeploymentsInSyncWithPoolSpecCondition may be more precise, but...

2uasimojo · 2021-08-11T15:13:20Z

pkg/controller/clusterpool/clusterpool_controller.go

 }

+func calculatePoolVersion(clp *hivev1.ClusterPool) string {
+	ba := []byte{}


Correct. Each individual hash is 64 bits == 16 hex chars, so right now the value is 64c. And yes, it would grow if we extended it to include more fields. Fortunately for us, there's effectively no limit to the length of an annotation value :)

(One thing that's sort of neat about concatenating the hashes of the individual fields is that you can tell, with a little bit of work, which field(s) changed. Not that I think we should advertise this as a feature or anything, but it may come in handy some day.)

Note that if we ever did decide to add new fields, installing that version of the code would spuriously invalidate the previous CDs no matter what formatting choices we make, now or then. So if we decide we want to include a dozen fields and we feel like a 192c hash is just too much and want to do something like use a hash-of-hashes instead, we can switch over at that time and it wouldn't be any worse.

pkg/controller/clusterpool/clusterpool_controller.go

pkg/controller/clusterpool/clusterpool_controller_test.go

pkg/controller/clusterpool/collections.go

2uasimojo · 2021-08-11T16:14:49Z

This is currently stacked on #1474.

/hold

until that merges.

#1474 is merged. Rebased.

/hold cancel

2uasimojo · 2021-08-11T18:39:07Z

/test e2e

2uasimojo · 2021-08-11T21:43:47Z

/test e2e

2uasimojo · 2021-08-12T04:01:48Z

/test e2e

2uasimojo · 2021-08-12T16:47:34Z

Updated (forgot to make vendor 🙄).

I don't know wtf is wrong with e2e 😠

2uasimojo · 2021-08-12T16:55:05Z

gdi, if I had a nickel for every time I updated the vendor type file instead of the base one...

This commit adds a mechanism for calculating a hash representing the following fields in ClusterPool.Spec: - Platform - BaseDomain - ImageSetRef - InstallConfigSecretTemplateRef When creating a new ClusterDeployment for the pool, this hash is stored in a new annotation on the ClusterDeployment. The clusterpool controller checks all unassigned (installing or ready) ClusterDeployments and sets a new condition on the ClusterPool. This condition is: - "True" when all CDs are up to date with the pool (including the edge case where there are no CDs); - "False" when one or more CDs are stale; - "Unknown" if the annotation is missing from one or more CDs. - (Also "Unknown" if we haven't had a chance to figure it out yet.) Of note: - When "False" or "Unknown", the condition's Message contains a list of the offending CDs (in the spirit of ClusterSync's "Failed" condition). - "False" takes precedence over "Unknown" (including: we don't try to combine the CD lists in the Message). HIVE-1058

abhinavdahiya · 2021-08-12T17:25:54Z

/lgtm

openshift-ci · 2021-08-12T17:26:20Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: 2uasimojo, abhinavdahiya

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [2uasimojo,abhinavdahiya]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

3dc3c3c / openshift#1484 added setting and discovery of a version marker for ClusterPool and ClusterDeployment so we could tell whether they match. This commit goes a step further and uses that information to replace stale CDs such that the pool will eventually be consistent. The algorithm we use is as follows: If the pool is in steady state -- i.e. we're at capacity and all unclaimed CDs are finished installing -- delete *one* stale CD. This triggers another reconcile wherein the deleted CD is replaced. We're no longer in steady state until (at least) that CD finishes installing, whereupon we're eligible to repeat. Reasoning: - We don't want to immediately trash all stale CDs, as this could result in significant downtime following a pool edit. - We don't want to exceed capacity (size or maxSize) to do this rotation, so we go "down" instead of "up". - It would have been nice to just wait for the *replacement* CDs to finish installing; but that would have entailed tracking those somehow as being distinct from CDs added through other code paths. That could get tricky, especially across multiple pool edits. TODO: - As written, we'll even wait for *stale* CDs to finish installing before we start deleting. We should figure out a way to discount those when checking whether we're eligible for a deletion. - Consider exposing a knob allowing the consumer to tune how aggressively stale CDs are replaced. HIVE-1058

openshift-ci bot requested review from suhanime and twiest August 3, 2021 21:50

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 3, 2021

openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 3, 2021

openshift-ci bot requested review from abhinavdahiya, abutcher, gurnben and stevekuznetsov August 3, 2021 22:00

2uasimojo force-pushed the HIVE-1058 branch from e9d95a7 to 1984508 Compare August 3, 2021 22:04

abhinavdahiya reviewed Aug 4, 2021

View reviewed changes

apis/hive/v1/clusterdeployment_types.go Outdated Show resolved Hide resolved

abhinavdahiya reviewed Aug 4, 2021

View reviewed changes

pkg/controller/clusterpool/collections.go Outdated Show resolved Hide resolved

2uasimojo force-pushed the HIVE-1058 branch 2 times, most recently from 22036cc to f2fc7c0 Compare August 6, 2021 23:03