Skip to content

Conversation

@r4f4
Copy link
Contributor

@r4f4 r4f4 commented Dec 8, 2022

When we moved to using Image Galleries in Azure/IPI, we also changed the
location that MAO uses for the rhcos bootimages. These changes update
our ARM templates so that the bootimages are picked from the right
location. It fixes the following error when scaling compute nodes:

Error Message: failed to reconcile machine "maxu-upi2-gc7n8-worker-eastus3-68gdx": failed to create vm maxu-upi2-gc7n8-worker-eastus3-68gdx: failure sending request for machine maxu-upi2-gc7n8-worker-eastus3-68gdx: cannot create vm: compute.VirtualMachinesClient#CreateOrUpdate: Failure sending request: StatusCode=404 -- Original Error: Code="GalleryImageNotFound" Message=""The gallery image /subscriptions/$sub_id/resourceGroups/$resource_group/providers/Microsoft.Compute/galleries/$gallery_name/images/maxu-upi2-gc7n8-gen2/versions/412.86.20220930 is not available in eastus region. Please contact image owner to replicate to this region, or change your requested region."" Target="imageReference"

It is also one step closer to getting Azure UPI for aarch64.

@openshift-ci-robot openshift-ci-robot added jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. labels Dec 8, 2022
@openshift-ci-robot
Copy link
Contributor

@r4f4: This pull request references Jira Issue OCPBUGS-4654, which is valid. The bug has been moved to the POST state.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.13.0) matches configured target version for branch (4.13.0)
  • bug is in the state New, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @gpei

The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

When we moved to using Image Galleries in Azure/IPI, we also changed the
location that MAO uses for the rhcos bootimages. These changes update
our ARM templates so that the bootimages are picked from the right
location. It fixes the following error when scaling compute nodes:

Error Message: failed to reconcile machine "maxu-upi2-gc7n8-worker-eastus3-68gdx": failed to create vm maxu-upi2-gc7n8-worker-eastus3-68gdx: failure sending request for machine maxu-upi2-gc7n8-worker-eastus3-68gdx: cannot create vm: compute.VirtualMachinesClient#CreateOrUpdate: Failure sending request: StatusCode=404 -- Original Error: Code="GalleryImageNotFound" Message=""The gallery image /subscriptions/$sub_id/resourceGroups/$resource_group/providers/Microsoft.Compute/galleries/$gallery_name/images/maxu-upi2-gc7n8-gen2/versions/412.86.20220930 is not available in eastus region. Please contact image owner to replicate to this region, or change your requested region."" Target="imageReference"

It is also one step closer to getting Azure UPI for aarch64.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@r4f4
Copy link
Contributor Author

r4f4 commented Dec 8, 2022

/cc @Prashanth684

@r4f4
Copy link
Contributor Author

r4f4 commented Dec 8, 2022

CI Azure UPI jobs are expected to fail until openshift/release#34678 is merged.

@gpei
Copy link
Contributor

gpei commented Dec 9, 2022

/jira refresh

@openshift-ci-robot
Copy link
Contributor

@gpei: This pull request references Jira Issue OCPBUGS-4654, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.13.0) matches configured target version for branch (4.13.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @MayXuQQ

Details

In response to this:

/jira refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot requested a review from MayXuQQ December 9, 2022 03:16
@r4f4
Copy link
Contributor Author

r4f4 commented Dec 12, 2022

/hold
We need openshift/release#34678 merged first so this can be tested in CI

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Dec 12, 2022
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just use the market-place image?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would that work for aarch64/Arm64?

$: az vm image list --all --offer rh-ocp-worker --publisher redhat | jq '.[] | .architecture' | sort | uniq
"x64"

Copy link
Contributor

@Prashanth684 Prashanth684 Dec 14, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

marketplace images are not available for arm64, only for x86

@r4f4 r4f4 force-pushed the azure-upi-img-gallery branch from 3c0dcb6 to 2b2b175 Compare December 14, 2022 17:34
r4f4 added a commit to r4f4/release that referenced this pull request Dec 14, 2022
This change adds support to both Managed Images and Image Galleries for
the UPI workflow. Image Galleries support was added to the
openshift-installer to enable aarch64 IPI deployments and
openshift/installer#6684 will soon eanble it for
UPI as well.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how about use the previous image name "[concat(parameters('baseName'), '-image')]" ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@r4f4
Copy link
Contributor Author

r4f4 commented Dec 16, 2022

/test golint

openshift-merge-robot pushed a commit to openshift/release that referenced this pull request Dec 16, 2022
This change adds support to both Managed Images and Image Galleries for
the UPI workflow. Image Galleries support was added to the
openshift-installer to enable aarch64 IPI deployments and
openshift/installer#6684 will soon eanble it for
UPI as well.
@r4f4
Copy link
Contributor Author

r4f4 commented Dec 16, 2022

/test e2e-azure-ovn-upi

@r4f4
Copy link
Contributor Author

r4f4 commented Dec 19, 2022

/hold cancel

@openshift-ci-robot
Copy link
Contributor

/retest-required

Remaining retests: 0 against base HEAD 26a3caa and 2 for PR HEAD 12292f0 in total

@openshift-ci-robot
Copy link
Contributor

/retest-required

Remaining retests: 0 against base HEAD 21d9012 and 1 for PR HEAD 12292f0 in total

@openshift-ci-robot
Copy link
Contributor

/retest-required

Remaining retests: 0 against base HEAD c5bb817 and 0 for PR HEAD 12292f0 in total

@openshift-ci-robot
Copy link
Contributor

/hold

Revision 12292f0 was retested 3 times: holding

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 14, 2023
@r4f4
Copy link
Contributor Author

r4f4 commented Jan 16, 2023

/hold cancel

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 16, 2023
@r4f4
Copy link
Contributor Author

r4f4 commented Jan 16, 2023

/skip

@r4f4
Copy link
Contributor Author

r4f4 commented Jan 16, 2023

/retest-required

@openshift-ci-robot
Copy link
Contributor

/retest-required

Remaining retests: 0 against base HEAD c5bb817 and 2 for PR HEAD 12292f0 in total

@r4f4
Copy link
Contributor Author

r4f4 commented Jan 17, 2023

UPI job will fail until openshift/release#34984 merges.

@openshift-ci-robot
Copy link
Contributor

/retest-required

Remaining retests: 0 against base HEAD ff68351 and 1 for PR HEAD 12292f0 in total

@openshift-ci-robot
Copy link
Contributor

/retest-required

Remaining retests: 0 against base HEAD 68089b6 and 0 for PR HEAD 12292f0 in total

@openshift-ci-robot
Copy link
Contributor

/hold

Revision 12292f0 was retested 3 times: holding

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 18, 2023
@r4f4
Copy link
Contributor Author

r4f4 commented Jan 18, 2023

The same tests

Failing tests:
[sig-network-edge][Conformance][Area:Networking][Feature:Router] The HAProxy router should pass the gRPC interoperability tests [apigroup:config.openshift.io][apigroup:route.openshift.io][apigroup:operator.openshift.io] [Suite:openshift/conformance/parallel/minimal]
[sig-network-edge][Conformance][Area:Networking][Feature:Router][apigroup:route.openshift.io][apigroup:config.openshift.io] The HAProxy router should pass the http2 tests [apigroup:image.openshift.io][apigroup:operator.openshift.io] [Suite:openshift/conformance/parallel/minimal]

are also perma-failing in the periodic UPI job so not introduced in this PR. So there is no point in keeping retesting.

/override ci/prow/e2e-azure-ovn-upi
/hold cancel

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 18, 2023
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jan 18, 2023

@r4f4: Overrode contexts on behalf of r4f4: ci/prow/e2e-azure-ovn-upi

Details

In response to this:

The same tests

Failing tests:
[sig-network-edge][Conformance][Area:Networking][Feature:Router] The HAProxy router should pass the gRPC interoperability tests [apigroup:config.openshift.io][apigroup:route.openshift.io][apigroup:operator.openshift.io] [Suite:openshift/conformance/parallel/minimal]
[sig-network-edge][Conformance][Area:Networking][Feature:Router][apigroup:route.openshift.io][apigroup:config.openshift.io] The HAProxy router should pass the http2 tests [apigroup:image.openshift.io][apigroup:operator.openshift.io] [Suite:openshift/conformance/parallel/minimal]

are also perma-failing in the periodic UPI job so not introduced in this PR. So there is no point in keeping retesting.

/override ci/prow/e2e-azure-ovn-upi
/hold cancel

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot
Copy link
Contributor

/retest-required

Remaining retests: 0 against base HEAD aa31e26 and 2 for PR HEAD 12292f0 in total

@r4f4
Copy link
Contributor Author

r4f4 commented Jan 18, 2023

/override ci/prow/e2e-azure-ovn-upi
same 2 e2e tests failing

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jan 18, 2023

@r4f4: Overrode contexts on behalf of r4f4: ci/prow/e2e-azure-ovn-upi

Details

In response to this:

/override ci/prow/e2e-azure-ovn-upi
same 2 e2e tests failing

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jan 18, 2023

@r4f4: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/e2e-aws-ovn-upgrade 12292f0 link false /test e2e-aws-ovn-upgrade
ci/prow/e2e-aws-ovn-disruptive 12292f0 link false /test e2e-aws-ovn-disruptive
ci/prow/e2e-azure-ovn-shared-vpc 12292f0 link false /test e2e-azure-ovn-shared-vpc
ci/prow/e2e-azure-ovn-resourcegroup 12292f0 link false /test e2e-azure-ovn-resourcegroup
ci/prow/e2e-metal-assisted 12292f0 link false /test e2e-metal-assisted
ci/prow/e2e-aws-ovn-workers-rhel8 12292f0 link false /test e2e-aws-ovn-workers-rhel8
ci/prow/e2e-azurestack-upi 12292f0 link false /test e2e-azurestack-upi
ci/prow/e2e-azurestack 12292f0 link false /test e2e-azurestack

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@r4f4
Copy link
Contributor Author

r4f4 commented Jan 18, 2023

/override ci/prow/e2e-azure-ovn
This change only touches the UPI ARM files for Azure, so IPI is not affected.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jan 18, 2023

@r4f4: Overrode contexts on behalf of r4f4: ci/prow/e2e-azure-ovn

Details

In response to this:

/override ci/prow/e2e-azure-ovn
This change only touches the UPI ARM files for Azure, so IPI is not affected.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-merge-robot openshift-merge-robot merged commit 9aed759 into openshift:master Jan 18, 2023
@openshift-ci-robot
Copy link
Contributor

@r4f4: All pull requests linked via external trackers have merged:

Jira Issue OCPBUGS-4654 has been moved to the MODIFIED state.

Details

In response to this:

When we moved to using Image Galleries in Azure/IPI, we also changed the
location that MAO uses for the rhcos bootimages. These changes update
our ARM templates so that the bootimages are picked from the right
location. It fixes the following error when scaling compute nodes:

Error Message: failed to reconcile machine "maxu-upi2-gc7n8-worker-eastus3-68gdx": failed to create vm maxu-upi2-gc7n8-worker-eastus3-68gdx: failure sending request for machine maxu-upi2-gc7n8-worker-eastus3-68gdx: cannot create vm: compute.VirtualMachinesClient#CreateOrUpdate: Failure sending request: StatusCode=404 -- Original Error: Code="GalleryImageNotFound" Message=""The gallery image /subscriptions/$sub_id/resourceGroups/$resource_group/providers/Microsoft.Compute/galleries/$gallery_name/images/maxu-upi2-gc7n8-gen2/versions/412.86.20220930 is not available in eastus region. Please contact image owner to replicate to this region, or change your requested region."" Target="imageReference"

It is also one step closer to getting Azure UPI for aarch64.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@jinyunma
Copy link
Contributor

/cherry-pick release-4.12

@openshift-cherrypick-robot

@jinyunma: new pull request created: #7054

Details

In response to this:

/cherry-pick release-4.12

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged. qe-approved Signifies that QE has signed off on this PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.