Skip to content

Conversation

@petr-muller
Copy link
Member

  • multiarch: add scaffolding for hetero->hetero upgrade jobs
  • make update

@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 4, 2022
@openshift-ci-robot
Copy link
Contributor

@petr-muller: the following rehearsable tests have been affected by this change:

Test name Repo Type Reason
periodic-ci-openshift-multiarch-master-nightly-4.13-ocp-e2e-aws-ovn-heterogeneous-upgrade N/A periodic Periodic changed

Prior to this PR being merged, you will need to either run and acknowledge or opt to skip these rehearsals.

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 10 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse more to run up to 20 rehearsals
Comment: /pj-rehearse max to run up to 35 rehearsals

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals

@petr-muller
Copy link
Member Author

/pj-rehearse

@petr-muller
Copy link
Member Author

petr-muller commented Nov 4, 2022

   * failed to generate steps from config: failed to resolve release arm64-latest: failed to request latest release: server responded with 404: no release configuration exists with the requested name 

It seems we do not have 4.13 heterogeneous releses yet...

@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from 25feab5 to 855afd9 Compare November 4, 2022 14:06
@petr-muller
Copy link
Member Author

/pj-rehearse

@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from 855afd9 to 86ab588 Compare November 4, 2022 14:13
@petr-muller
Copy link
Member Author

/pj-rehearse

@petr-muller
Copy link
Member Author

Seeing some noise in the https://steps.ci.openshift.org/reference/ipi-install-heterogeneous step:

Creating aarch64 worker MachineSet
Warning: incorrect GroupVersionKind for AWSMachineProviderConfig object: machine.openshift.io/v1beta1, Kind=AWSMachineProviderConfig
machineset.machine.openshift.io/ci-op-8lix2l21-008f2-qtw4c-worker-us-west-2a-additional created
W1104 15:12:13.138293      96 loader.go:223] Config not found: /tmp/installer/auth/kubeconfig
Error from server (Forbidden): routes.route.openshift.io "console" is forbidden: User "system:serviceaccount:ci-op-8lix2l21:ocp-e2e-aws-ovn-heterogeneous-upgrade" cannot get resource "routes" in API group "route.openshift.io" in the namespace "openshift-console"
Exiting with ret=0

@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from 86ab588 to f4c63fc Compare November 4, 2022 18:11
@petr-muller
Copy link
Member Author

/pj-rehearse

@petr-muller
Copy link
Member Author

  1. nightly-4.11-ocp-e2e-aws-heterogeneous
    a. templateinstance readiness test should report ready soon after all annotated objects are ready does not seem to be related but I want more runs.
    b. Heterogeneous setup looks good
  2. nightly-4.12-upgrade-from-nightly-4.11 looks good
  3. nightly-4.12-upgrade-from-stable-4.11 went well but failed on CI infra flake in post phase
  4. nightly-4.12-ocp-e2e-aws-ovn-heterogeneous
    a. alert TelemeterClientFailures fired for 144 seconds with labels: {namespace="openshift-monitoring", severity="warning"} does not seem to be related but I want more runs
    b. Warning: incorrect GroupVersionKind for AWSMachineProviderConfig object: machine.openshift.io/v1beta1, Kind=AWSMachineProviderConfig in a heterogeneous setup step seems to be in all initial 4.12 jobs, and does not seem to be caused by the step itself - this content is pulled from the cluster, modified and re-applied
  5. nightly-4.13-* jobs all fail with failed to resolve release arm64-latest: failed to request latest release: server responded with 404: no release configuration exists with the requested name : there are no 4.13 releases on https://multi.ocp.releases.ci.openshift.org/ yet

@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 7, 2022
@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from f4c63fc to 7abc5d0 Compare November 7, 2022 16:13
@openshift-merge-robot openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 7, 2022
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not sure about a good set of reviewers here, this is a copy paste of others' upgrade workflow OWNERS file.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is probably worth including someone from multi-arch in here, so they can approve new upgrade-aws-heterogeneous-* steps/chains/workflows if we need more in the future. @deepsm007 , would that be you? Or can you recommend someone else?

I don't think we need to block on having someone from multi-arch though. If we stick with the parent-dir approver set, hack/dedup-owners.sh makes it easy to set up a DRYer symlink.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@deepsm007 and @Prashanth684 to own heterogeneous

@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from 7abc5d0 to b4dc07d Compare November 7, 2022 16:18
@petr-muller
Copy link
Member Author

/pj-rehearse

@petr-muller petr-muller changed the title WIP: OTA-656: Add CI jobs for heterogeneous cluster upgrades OTA-656: Add CI jobs for heterogeneous cluster upgrades Nov 7, 2022
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 7, 2022
@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from b4dc07d to 8fe905f Compare November 7, 2022 16:35
@petr-muller
Copy link
Member Author

/pj-rehearse

@petr-muller
Copy link
Member Author

/uncc @fabianofranz @jaypoulz
/cc @deepsm007 @aleskandro @jottofar @wking

I think is is now reviewable. Several questions:

  1. I selected random hours of the day for the cron patterns and set the day to Monday to avoid collisions with MIXEDARCH-37: Arm worker nodes for heterogeneous workflow #31965, but I do not have the full context about why the crons are actually needed, so I'd welcome insights about whether these are set appropriately
  2. Is the scope correct? Do we need these jobs? Fewer of them? Are some missing?
  3. Who are good owners for the workflow? OTA? Multiarch folks?

@openshift-ci openshift-ci bot requested review from aleskandro, deepsm007 and jottofar and removed request for fabianofranz and jaypoulz November 7, 2022 16:37
@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 8, 2022
@petr-muller
Copy link
Member Author

/pj-rehearse ack

@openshift-ci-robot openshift-ci-robot added the rehearsals-ack Signifies that rehearsal jobs have been acknowledged label Nov 9, 2022
Copy link
Member

@wking wking left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can refresh the LGTM after the OWNERS tweak.

/approve
/hold for OWNERS tweak

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 9, 2022
@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 9, 2022
- `$KUBECONFIG` is set by ci-operator
- Console URL extraction only needs to be done by steps that *install* a
  cluster
@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from 9ce7d4e to 0debcc7 Compare November 10, 2022 13:35
@openshift-ci openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Nov 10, 2022
@petr-muller petr-muller force-pushed the ota-656-hetero-updates branch from 0debcc7 to 287b13f Compare November 10, 2022 13:39
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 10, 2022

@petr-muller: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.12-upgrade-from-nightly-4.11-ocp-ovn-remote-libvirt-ppc64le f4c63fc640a1d62e9ca04fa720ff240df86c9bc5 link unknown /test pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.12-upgrade-from-nightly-4.11-ocp-e2e-aws-heterogeneous-upgrade 8fe905f8faac1ecc5bc1f89ed051ce10844da891 link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.13-upgrade-from-stable-4.12-ocp-e2e-aws-heterogeneous-upgrade 8fe905f8faac1ecc5bc1f89ed051ce10844da891 link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.12-ocp-e2e-aws-ovn-heterogeneous-upgrade 8fe905f8faac1ecc5bc1f89ed051ce10844da891 link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.13-upgrade-from-nightly-4.12-ocp-ovn-remote-libvirt-s390x 8fe905f8faac1ecc5bc1f89ed051ce10844da891 link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.13-upgrade-from-nightly-4.12-ocp-e2e-aws-sdn-arm64 8fe905f8faac1ecc5bc1f89ed051ce10844da891 link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.13-ocp-e2e-aws-ovn-heterogeneous-upgrade b4dc07d5dc48e31b07598bca18ba10d6c0d4d24d link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.13-upgrade-from-nightly-4.12-ocp-e2e-aws-heterogeneous-upgrade b4dc07d5dc48e31b07598bca18ba10d6c0d4d24d link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-openshift-tests-private-release-4.12-multi-nightly-e2e-aws-ipi-p2 8fe905f8faac1ecc5bc1f89ed051ce10844da891 link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.13-upgrade-from-nightly-4.12-ocp-ovn-remote-libvirt-ppc64le b4dc07d5dc48e31b07598bca18ba10d6c0d4d24d link unknown /pj-rehearse
ci/rehearse/periodic-ci-openshift-multiarch-master-nightly-4.13-ocp-e2e-aws-ovn-heterogeneous 8fe905f8faac1ecc5bc1f89ed051ce10844da891 link unknown /pj-rehearse

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@petr-muller
Copy link
Member Author

/hold cancel
/pj-rehearse

Updated OWNERS, also submitted #33932 separately so that I can help with maintaining all of these.

Can I have a re-/lgtm?

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 10, 2022
Copy link
Member

@wking wking left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 10, 2022
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 10, 2022

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: aleskandro, deepsm007, petr-muller, wking

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-robot openshift-merge-robot merged commit ec16f10 into openshift:master Nov 10, 2022
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 10, 2022

@petr-muller: Updated the following 2 configmaps:

  • ci-operator-master-configs configmap in namespace ci at cluster app.ci using the following files:
    • key openshift-multiarch-master__nightly-4.11.yaml using file ci-operator/config/openshift/multiarch/openshift-multiarch-master__nightly-4.11.yaml
    • key openshift-multiarch-master__nightly-4.12-upgrade-from-nightly-4.11.yaml using file ci-operator/config/openshift/multiarch/openshift-multiarch-master__nightly-4.12-upgrade-from-nightly-4.11.yaml
    • key openshift-multiarch-master__nightly-4.12-upgrade-from-stable-4.11.yaml using file ci-operator/config/openshift/multiarch/openshift-multiarch-master__nightly-4.12-upgrade-from-stable-4.11.yaml
    • key openshift-multiarch-master__nightly-4.12.yaml using file ci-operator/config/openshift/multiarch/openshift-multiarch-master__nightly-4.12.yaml
    • key openshift-multiarch-master__nightly-4.13-upgrade-from-nightly-4.12.yaml using file ci-operator/config/openshift/multiarch/openshift-multiarch-master__nightly-4.13-upgrade-from-nightly-4.12.yaml
    • key openshift-multiarch-master__nightly-4.13-upgrade-from-stable-4.12.yaml using file ci-operator/config/openshift/multiarch/openshift-multiarch-master__nightly-4.13-upgrade-from-stable-4.12.yaml
    • key openshift-multiarch-master__nightly-4.13.yaml using file ci-operator/config/openshift/multiarch/openshift-multiarch-master__nightly-4.13.yaml
  • job-config-master-periodics configmap in namespace ci at cluster app.ci using the following files:
    • key openshift-multiarch-master-periodics.yaml using file ci-operator/jobs/openshift/multiarch/openshift-multiarch-master-periodics.yaml
Details

In response to this:

  • multiarch: add scaffolding for hetero->hetero upgrade jobs
  • make update

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. rehearsals-ack Signifies that rehearsal jobs have been acknowledged

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants