-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NodeRef controller #1011
NodeRef controller #1011
Conversation
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: vincepri The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
69a9991
to
ba2230a
Compare
ba2230a
to
b17478c
Compare
b17478c
to
9153851
Compare
@ncdc ptal |
78053dd
to
4229b86
Compare
25ddda8
to
38199c8
Compare
Definitely add a release note to this PR please. |
38199c8
to
d2581ff
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there a doc we can update to indicate CAPI now has read access to all secrets across all namespaces (and why), along with guidance on how to scope it narrower? |
d2581ff
to
fe09ec0
Compare
1cbe941
to
0f5cdf1
Compare
Could we please update the PR desc with the purpose of this PR? |
@enxebre Done, thanks for keeping me honest :) |
4464292
to
152c1c5
Compare
152c1c5
to
73e1d4f
Compare
73e1d4f
to
a5b733a
Compare
} | ||
|
||
// Update Machine. | ||
machine.Status.NodeRef = nodeRef |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are currently workflows related to the existing Node and Machine controllers that will attempt to perform actions when the nodeRef is set.
- Node deletion:
cluster-api/pkg/controller/machine/machine_controller.go
Lines 182 to 188 in bd0628f
if m.Status.NodeRef != nil { klog.Infof("Deleting node %q for machine %q", m.Status.NodeRef.Name, m.Name) if err := r.deleteNode(ctx, m.Status.NodeRef.Name); err != nil { klog.Errorf("Error deleting node %q for machine %q", name, err) return reconcile.Result{}, err } } - isDeleteAllowed:
cluster-api/pkg/controller/machine/machine_controller.go
Lines 256 to 275 in bd0628f
func (r *ReconcileMachine) isDeleteAllowed(machine *clusterv1.Machine) bool { if r.nodeName == "" || machine.Status.NodeRef == nil { return true } if machine.Status.NodeRef.Name != r.nodeName { return true } node := &corev1.Node{} if err := r.Client.Get(context.Background(), client.ObjectKey{Name: r.nodeName}, node); err != nil { klog.Infof("Failed to determine if controller's node %q is associated with machine %q: %v", r.nodeName, machine.Name, err) return true } // When the UID of the machine's node reference and this controller's actual node match then then the request is to // delete the machine this machine-controller is running on. Return false to not allow machine controller to delete its // own machine. return node.UID != machine.Status.NodeRef.UID } - MachineSet status:
cluster-api/pkg/controller/machineset/status.go
Lines 58 to 63 in e325dfa
if noderefutil.IsNodeReady(node) { readyReplicasCount++ if noderefutil.IsNodeAvailable(node, ms.Spec.MinReadySeconds, metav1.Now()) { availableReplicasCount++ } }
Have these been tested with these changes?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Those all fail as you’d expect. I’ll follow up with other PRs to tackle them separately
|
||
// Equals returns true if both the CloudProvider and ID match. | ||
func (p *ProviderID) Equals(o *ProviderID) bool { | ||
return p.CloudProvider() == o.CloudProvider() && p.ID() == o.ID() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wouldn't this result in aws://// being != aws:////?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we disallow empty IDs for validation, but that should work if you only test Equals
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oops, markdown killed my comparison... I meant aws:////i-xxxxxx
being != aws:///us-east-1a/i-xxxxxx
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ID() will only return the last part of the ProviderID. That comparison you mentioned should be tested here https://github.com/kubernetes-sigs/cluster-api/pull/1011/files#diff-a5350fd6ed4fabde029f67abad7b17a5R110
Signed-off-by: Vince Prignano <[email protected]>
a5b733a
to
99c24e4
Compare
/test pull-cluster-api-test |
/lgtm |
Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 9e89546)
* Remove duplicate checks for deletionTimestamp (#987) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit aacb0c6) * Rename pkg/controller/controller (#998) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit eafafe8) * Add remote/util.go helpers to work with KubeConfig Secrets (#1004) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit d48025d) * Add remote.ClusterClient to access remote workload clusters (#1006) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 0b25546) * Add events to MachineSet operations (#1012) Add events to the following MachineSet operations - Adopt Machine - Create Machine - Delete Machine (cherry picked from commit b0170a0) * Add patch to events rbac (#1021) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 7365154) * Update some bazel dependencies (#1019) (cherry picked from commit 77836d8) * Update boilerplate check and generated files (#1010) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 0b215f4) * Add CLUSTER_API_KUBECONFIG_READY_TIMEOUT to clusterctl (#1026) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 9cd85f7) * Add events to MachineDeployment operations (#1014) (cherry picked from commit a5e5fdb) * Update controller-runtime and controller-tools (#1032) (cherry picked from commit da61280) * Support for wrapped RequeueAfterError/interface (#1020) This patch adds support for wrapped "RequeueAfterError" errors as well as a new interface for supporting custom "RequeueAfterError" errors. (cherry picked from commit bd0628f) * Use klog in manager and setup controller-runtime logger (#1022) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 0ebf076) * Bazel updates (#1043) - Remove use of deprecated bazel attribute label single_file - Add bazel version checking - Include go_rules fix for cross-compiling darwin binaries (cherry picked from commit 2123eed) * clean up shell scripts in cluster-api (#1045) (cherry picked from commit d63d4be) * Update k8s/code-generator to latest release-1.13 (#1048) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit c2faf86) * downgrade log of util.ExecuteCommand (#975) As this error E0531 is not that ueful and no need to be mark this as 'E', use Info level should be fine as this won't affect return value. (cherry picked from commit 97f25c9) * Make ClusterNetwork ClusterNetworkingConfig field optional within (#919) Cluster resources. This is useful for Managed Control Planes and Bare Metal where networking configuration options may be limited. (cherry picked from commit 9e1eb56) * NodeRef controller (#1011) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 9e89546) * Update generated files, add workaround for controller-tools (#1050) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 1c10a3f) * Update controller-tools to 0.1.11 (#1053) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 1cb3deb) * Update dep check (#1056) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit aecec29) * Fix register bootstrap options in alpha phases (#1064) Fixes issue: #1063 (cherry picked from commit 8141304) * Use remote NodeRef in Machine and MachineSet controllers (#1052) Signed-off-by: Vince Prignano <[email protected]> (cherry picked from commit 1eaf864) * Bazel fixes Signed-off-by: Andy Goldstein <[email protected]>
return nil, err | ||
} | ||
|
||
for _, node := range nodeList.Items { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In a follow up we could use an indexer here and filter the request by node.Spec.ProviderID
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@enxebre if you'd like to see that happen, please file an issue. Otherwise, I fear this comment will got lost. Thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sure, I'll also put a PR
Signed-off-by: Vince Prignano [email protected]
What this PR does / why we need it:
This PR adds a new controller to set
Machine.Status.NodeRef
. The primary goal of this work is to support controllers running in a management cluster. The PR is based on parts of #997 proposal while being compatible with v1alpha1.Which issue(s) this PR fixes (optional, in
fixes #<issue number>(, fixes #<issue_number>, ...)
format, will close the issue(s) when PR gets merged):Fixes #520
Special notes for your reviewer:
Please confirm that if this PR changes any image versions, then that's the sole change this PR makes.
Release note: