Add Enhancement for Installing to Azure Stack Hub #689

patrickdillon · 2021-03-10T19:34:51Z

Create an enhancement to outline work for installing and supporting OpenShift on Azure Stack Hub.

cc @openshift/openshift-team-installer
cc @staebler

/label priority/important-soon

openshift-ci-robot · 2021-03-10T19:34:52Z

@patrickdillon: The label(s) /label priority/important-soon cannot be applied. These labels are supported: platform/aws, platform/azure, platform/baremetal, platform/google, platform/libvirt, platform/openstack, ga, tide/merge-method-merge, tide/merge-method-rebase, tide/merge-method-squash, px-approved, docs-approved, qe-approved, downstream-change-needed

Details

In response to this:

Create an enhancement to outline work for installing and supporting OpenShift on Azure Stack Hub.

cc @openshift/openshift-team-installer
cc @staebler

/label priority/important-soon

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

enhancements/installer/azurestack.md

staebler · 2021-03-12T02:14:26Z

enhancements/installer/azurestack.md

+
+The Installer will need to construct this json configuration file from user input and include the file as part of the cloud-provider-config configmap. The file will also need to be present on nodes for the kubelet. `AZURE_ENVIRONMENT_FILEPATH` will need to be set programmatically for ASH on the kubelet, presumably through the kubelet's systemd unit [(see open questions)](#open-questions).
+
+Operators will set the `AZURE_ENVIRONMENT_FILEPATH` and mount the endpoints JSON file from the cloud-provider-config configmap. All operators using the Azure SDK will need to do this.


Is there a way for operators to configure the Azure client directly with the custom endpoints rather than relying on a json file? It would be cumbersome for an operator with a static manifest to do this with mounts, given that the mounted ConfigMap must be copied over to the namespace. The mounting option is feasible for controllers that the operator starts but not easily for operators directly.

Follow up on this. You can definitely set the endpoint programmatically when creating the client.
For example, see the following.
https://github.com/staebler/cluster-ingress-operator/blob/3bb8ed560892ae648806e6133d18803da9ec341b/pkg/dns/azure/client/client.go#L88

@staebler I have updated this and I think these revisions make more sense.

staebler · 2021-03-12T02:22:06Z

enhancements/installer/azurestack.md

+ASH endpoints are user-provided and, therefore, the Azure SDK treats ASH endpoints differently than the already-known endpoints of other Azure environments. When the cloud environment is set to
+`AZURESTACKCLOUD` the SDK expects the environment variable `AZURE_ENVIRONMENT_FILEPATH` to point to a [json configuration file](https://kubernetes-sigs.github.io/cloud-provider-azure/install/configs/#azure-stack-configuration), which is [typically located at `/etc/kuberentes/azurestackcloud.json`](https://github.com/kubernetes-sigs/cloud-provider-azure/issues/151).
+
+The Installer will need to construct this json configuration file from user input and include the file as part of the cloud-provider-config configmap. The file will also need to be present on nodes for the kubelet. `AZURE_ENVIRONMENT_FILEPATH` will need to be set programmatically for ASH on the kubelet, presumably through the kubelet's systemd unit [(see open questions)](#open-questions).


Where will this user input come from? Is the expectation that they will have already set up an AZURE_ENVIRONMENT_FILEPATH on the install machine, given that the installer will presumably need to know the endpoints for its accesses to the Azure API?

I think the update should clarify this.

enhancements/installer/azurestack.md

staebler · 2021-03-12T02:33:46Z

enhancements/installer/azurestack.md

+### Open Questions
+
+1. We need to explore how to pass the JSON endpoints file to the Kubelet:
+    1. Can the endpoints file be simply passed from the installer through ignition? Does this


Could this be MachineConfigs that the installer adds for master and workers?

Does the bootstrap machine need this as well? I am assuming not since you have been able to run kubelet successfully on the bootstrap machine with the Azure platform.

enhancements/installer/azurestack.md

staebler · 2021-03-12T02:42:59Z

enhancements/installer/azurestack.md

+  unit's `EnvironmentFile`s](https://github.com/openshift/machine-config-operator/blob/master/templates/master/01-master-kubelet/_base/units/kubelet.service.yaml#L19). If so, which & how
+  (ignition or MCO)?
+1. Should ASH produce a cluster DNS manifest or follow the baremetal approach and not create one?
+1. Suggestions for determining all operators that need to adapt for the ASH config.


https://sourcegraph.com/search?q=repo:github.com/openshift/+rev:master+file:vendor/github.com/Azure/azure-sdk-for-go/version/version.go+archived:no+count:1000+fork:yes&patternType=literal

enhancements/installer/azurestack.md

patrickdillon · 2021-03-31T20:44:06Z

/retest

patrickdillon · 2021-04-01T16:00:25Z

@gnufied brought up CSI support for Azure Stack openshift/kubernetes#643 (comment)

We do not have a good background on storage, so I'm glad you reached out. Some quick googling led me to this: https://github.com/kubernetes-sigs/azuredisk-csi-driver

Can we discuss what more needs to be done to add storage support? We can move it over to JIRA too which might be more appropriate.

enhancements/installer/azurestack.md

staebler · 2021-04-03T01:16:02Z

enhancements/installer/azurestack.md

+- requires different rhcos image
+- limited instance metadata service (IMDS) for VMs
+- does not support private DNS zones
+- limited subset of Azure infrastructure available (ergo different Terraform provider)


One other aspect that may fall under this category of limiited subset of infrastructure is limited API versions available. We need to be careful that in the future we are cognizant of API version selection and are specific about which API versions an Azure Stack deployment must support in order to run OpenShift.

staebler · 2021-04-03T01:19:07Z

enhancements/installer/azurestack.md

+[cloud provider configmap](https://github.com/openshift/installer/blob/master/pkg/asset/manifests/cloudproviderconfig.go#L126-L141):
+
+```go
+  cloudProviderEndpointsKey = "endpoints"


Suggested change

cloudProviderEndpointsKey = "endpoints"

cloudProviderEndpointsKey = "endpoints.json"

staebler · 2021-04-03T01:24:55Z

enhancements/installer/azurestack.md

+This provider lacks the ability to create a service principal for the resource group and assign a contributor role, which is required by the Ingress controller.
+
+These actions are achievable through the CLI, if necessary these commands could be run as a Terraform post hook.


Is this conflating the service principal used by the ingress operator with the user-assigned identity attached to the VMs, which is used by the kube-controller-manager? For public Azure, the terraform creates a user-assigned identity but does not create any service principals. Azure Stack does not support user-assigned identities, even through the CLI.

My statement in the enhancement is confused. The issue is, as you say, the lack of user-assigned identities--not service principals, but it looks like the "main" identity is scoped to the resource group; so I don't think it's the VM identities you mention. Here is the code from IPI. The UPI doc states:

Grant the Contributor role to the Azure identity so that the Ingress Operator can create a public IP and its load balancer.

I'm not sure I have a firm grasp on the problem. From reading the docs it seems we need to create this identity, to give operators access to create resources in the resource group. The machine API operator would probably need this as well, but perhaps it is not mentioned as these are UPI docs.

Really what I am missing is why/how isn't this handled by the CCO? If we're using passthrough I assume this isn't a problem at all because those creds clearly have perms to create resources in the resource group.

Let me know what you think and I'll update the doc accordingly.

This is handled by the CCO. All of the in-cluster operators use credentials that are granted by the CCO (assuming mint mode). The UPI docs are a bit misleading when they say that the Contributor role is needed by the Ingress operator. The Ingress operator is creating loadbalancer Services, but it is the kube-controller-manager that is creating the cloud resources for those Services.

In public Azure, the kube-controller-manager uses the user-assigned identity assigned to the VM. Azure Stack does not support user-assigned identities. Presumably, we could allow Azure Stack to create system-assigned identities for each VM. The installer would then need to assign the Contributor role to each system-assigned identity after each VM was created. The kubelet also uses the managed identity assigned to the VM, so the managed identity is need for worker machines too. However, the installer does not create the worker VMs so cannot assign the Contributor role to the system-assigned identities for the worker VMs. As far as I am aware, the machine-api does not have a feature for assigning roles to the system-assigned identities of the VMs that the machine-api creates.

A possible solution is to add that feature to the machine-api. Another solution is to supply a Service Principal in the cloud config that the kubelet and kube-controller-manager would use when accessing the Azure API. The latter solution is what I think you are using currently in your UPI work. There may be problems with utilizing that solution long-term due to how the user would replace that Service Principal. I don't think that simply changing the Service Principal in the cloud-provider-config ConfigMap would cause the replacement Service Principal to roll out to all of the kubelets and kube-controller-managers.

I get it now. The piece I was missing is that the main identity is assigned
to the VMs in TF and the machinesets. Then you explained the rest; thank
you for that!

If we want to manage the roles after creation, then VM extensions may
provide this:
https://registry.terraform.io/providers/hashicorp/azurestack/latest/docs/resources/virtual_machine_extension

But it seems the MAO may be more involved. Yes the UPI solution writes the
cloud provider config with the service principal ID & secret. So I guess
the question is: does updating the cloud provider config configmap in
cluster cause a new machine config to be written? Does a new machine config
cause a reboot, in which case the new creds would be picked up by the
kubelet.

Create an enhancement to outline work for installing and supporting OpenShift on Azure Stack Hub.

patrickdillon · 2021-04-06T18:55:56Z

@staebler This has been revised. PTAL

staebler

/lgtm

openshift-ci-robot · 2021-04-06T19:12:04Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: staebler

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [staebler]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci-robot requested review from juliakreger and jwmatthews March 10, 2021 19:35

patrickdillon force-pushed the azurestack branch from c3261d8 to 2f63268 Compare March 10, 2021 20:10

patrickdillon mentioned this pull request Mar 10, 2021

config: Azure Stack Hub support openshift/api#860

Merged

staebler reviewed Mar 12, 2021

View reviewed changes

patrickdillon commented Mar 12, 2021

View reviewed changes

enhancements/installer/azurestack.md Outdated Show resolved Hide resolved

patrickdillon commented Mar 12, 2021

View reviewed changes

enhancements/installer/azurestack.md Outdated Show resolved Hide resolved

patrickdillon force-pushed the azurestack branch 2 times, most recently from 2cea816 to 8f1b8b4 Compare March 31, 2021 20:22

patrickdillon mentioned this pull request Apr 1, 2021

Add AzureStack Support to Legacy Cloud Providers openshift/kubernetes#643

Closed

staebler reviewed Apr 3, 2021

View reviewed changes

patrickdillon force-pushed the azurestack branch from 8f1b8b4 to fba6a5e Compare April 5, 2021 20:05

Add Enhancement for Installing to Azure Stack Hub

4da334b

Create an enhancement to outline work for installing and supporting OpenShift on Azure Stack Hub.

patrickdillon force-pushed the azurestack branch from fba6a5e to 4da334b Compare April 5, 2021 20:33

staebler approved these changes Apr 6, 2021

View reviewed changes

openshift-ci-robot assigned staebler Apr 6, 2021

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Apr 6, 2021

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 6, 2021

openshift-merge-robot merged commit 5da7a7d into openshift:master Apr 6, 2021


		The Installer will need to construct this json configuration file from user input and include the file as part of the cloud-provider-config configmap. The file will also need to be present on nodes for the kubelet. `AZURE_ENVIRONMENT_FILEPATH` will need to be set programmatically for ASH on the kubelet, presumably through the kubelet's systemd unit [(see open questions)](#open-questions).

		Operators will set the `AZURE_ENVIRONMENT_FILEPATH` and mount the endpoints JSON file from the cloud-provider-config configmap. All operators using the Azure SDK will need to do this.

	cloudProviderEndpointsKey = "endpoints"
	cloudProviderEndpointsKey = "endpoints.json"

		This provider lacks the ability to create a service principal for the resource group and assign a contributor role, which is required by the Ingress controller.

		These actions are achievable through the CLI, if necessary these commands could be run as a Terraform post hook.

Add Enhancement for Installing to Azure Stack Hub #689

Add Enhancement for Installing to Azure Stack Hub #689

Uh oh!

Conversation

patrickdillon commented Mar 10, 2021

Uh oh!

openshift-ci-robot commented Mar 10, 2021

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

patrickdillon commented Mar 31, 2021

Uh oh!

patrickdillon commented Apr 1, 2021

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

patrickdillon Apr 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

patrickdillon commented Apr 6, 2021

Uh oh!

staebler left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-ci-robot commented Apr 6, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

patrickdillon Apr 3, 2021 •

edited

Loading