Skip to content

Conversation

@jianlinliu
Copy link
Contributor

@jianlinliu jianlinliu commented Jul 30, 2025

  1. Add run_if_changed: (gcp|google) on gcp custom-dns jobs, to help us catch regression early, e.g: CORS-4027: Updating feature gate name to GCPClusterHostedDNSInstall #67624 (comment)
  2. Overwrite test to drop e2e testing from installer pre-submit gcp custom-dns jobs to save cloud cost.
  3. Update gcp-private job together to drop the useless steps.

@openshift-ci openshift-ci bot requested review from droslean and lihongan July 30, 2025 06:22
@jianlinliu
Copy link
Contributor Author

/pj-rehearse periodic-ci-openshift-release-master-nightly-4.20-e2e-gcp-custom-dns

@openshift-ci-robot
Copy link
Contributor

@jianlinliu: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@jianlinliu
Copy link
Contributor Author

/close

@openshift-ci openshift-ci bot closed this Jul 30, 2025
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 30, 2025

@jianlinliu: Closed this PR.

Details

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@jianlinliu
Copy link
Contributor Author

In favor of #67624

@jianlinliu
Copy link
Contributor Author

/reopen

@openshift-ci openshift-ci bot reopened this Aug 1, 2025
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 1, 2025

@jianlinliu: Reopened this PR.

Details

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@jianlinliu jianlinliu changed the title update gcp custom dns ci jobs to use new featureGate name refine gcp private and custom-dns presubmit jobs Aug 1, 2025
@jianlinliu jianlinliu force-pushed the gcp-custom-dns-rename branch 5 times, most recently from d4f1f01 to e831e1a Compare August 1, 2025 06:35
@jianlinliu
Copy link
Contributor Author

/pj-rehearse pull-ci-openshift-installer-release-4.19-gcp-private pull-ci-openshift-installer-release-4.20-gcp-custom-dns

@openshift-ci-robot
Copy link
Contributor

@jianlinliu: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 1, 2025

@jianlinliu: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/rehearse/periodic-ci-openshift-release-master-nightly-4.20-e2e-gcp-custom-dns 69a72f9 link unknown /pj-rehearse periodic-ci-openshift-release-master-nightly-4.20-e2e-gcp-custom-dns

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@jianlinliu
Copy link
Contributor Author

/pj-rehearse pull-ci-openshift-installer-release-4.20-gcp-custom-dns

@openshift-ci-robot
Copy link
Contributor

@jianlinliu: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@jianlinliu
Copy link
Contributor Author

/pj-rhearse abort

@jianlinliu jianlinliu force-pushed the gcp-custom-dns-rename branch from e831e1a to 24a3384 Compare August 1, 2025 10:32
@openshift-ci-robot
Copy link
Contributor

[REHEARSALNOTIFIER]
@jianlinliu: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
pull-ci-openshift-installer-main-gcp-custom-dns openshift/installer presubmit Presubmit changed
pull-ci-openshift-installer-release-4.19-gcp-custom-dns openshift/installer presubmit Presubmit changed
pull-ci-openshift-installer-release-4.20-gcp-custom-dns openshift/installer presubmit Presubmit changed
pull-ci-openshift-installer-release-4.21-gcp-custom-dns openshift/installer presubmit Presubmit changed
pull-ci-openshift-installer-release-4.21-gcp-private openshift/installer presubmit Ci-operator config changed
pull-ci-openshift-installer-release-4.19-gcp-private openshift/installer presubmit Ci-operator config changed
pull-ci-openshift-installer-main-gcp-private openshift/installer presubmit Ci-operator config changed
pull-ci-openshift-installer-release-4.20-gcp-private openshift/installer presubmit Ci-operator config changed
Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@jianlinliu
Copy link
Contributor Author

/pj-rehearse pull-ci-openshift-installer-release-4.20-gcp-custom-dns

@openshift-ci-robot
Copy link
Contributor

@jianlinliu: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@jianlinliu
Copy link
Contributor Author

/pj-rehearse ack

@openshift-ci-robot
Copy link
Contributor

@jianlinliu: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@openshift-ci-robot openshift-ci-robot added the rehearsals-ack Signifies that rehearsal jobs have been acknowledged label Aug 1, 2025
@jianlinliu
Copy link
Contributor Author

cc @gpei and @yunjiang29 to review

@gpei
Copy link
Contributor

gpei commented Aug 4, 2025

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Aug 4, 2025
@yunjiang29
Copy link
Contributor

/lgtm

FEATURE_SET: CustomNoUpgrade
USER_PROVISIONED_DNS: "yes"
test:
- chain: cucushift-installer-check-cluster-health
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So this will stop e2e tests from running on the custom DNS job?

Are we doing this because the e2e tests are permafailing? I am ok with that, but not sure if there are other options?

@sadasu & I were discussing this today and there were roughly ~10 tests that failed on both of the runs for the custom DNS job. I believe @gpei mentioned this before, and that it would not make sense to add ignores for each individual test.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

because the e2e tests are permafailing

This is one of the reasons. Some others:

  1. We set a public domain which resides in QE project, so it depends on QE's cluster profile. While the QE cloud project have a small number of budget. So skip the e2e testing to save the running time for saving the cloud cost.
  2. The job is for installer presubmits jobs, most of time, we are focus on installation results, so I drop the e2e to avoid the noise from unexpected e2e failure.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The most ideal solution in practice would be to fix the tests in the origin repo, either by making them pass or skipping them, so that we can see the most accurate and reasonable e2e test results.
But I believe that having each team update their test code to adapt all test cases to every new cluster type we introduced would be a big workload for them, and potentially with a very lengthy timeline.
During our initial testing of custom DNS, I informed the Ingress team that certain tests were consistently failing. They submitted PR openshift/origin#29985 to fix these issues, but the PR is still under review to this day.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jianlinliu and @gpei thanks for that explanation. I agree that installer pre-submit jobs should can be limited to verifying a successful install.

We will be able to catch other errors during the periodic e2e custom-dns jobs. I am concerned about scenarios like openshift/origin#29985 and how that will affect the health of our periodic job within CR.

@patrickdillon
Copy link
Contributor

/approve

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 6, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: gpei, jianlinliu, patrickdillon, yunjiang29

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 6, 2025
@sadasu
Copy link
Contributor

sadasu commented Aug 6, 2025

LGTM thanks for working on this.

@openshift-merge-bot openshift-merge-bot bot merged commit a923eaa into openshift:master Aug 6, 2025
16 of 17 checks passed
mcornea pushed a commit to mcornea/release that referenced this pull request Aug 7, 2025
tbuskey pushed a commit to tbuskey/release that referenced this pull request Aug 7, 2025
etmurasaki pushed a commit to etmurasaki/release that referenced this pull request Sep 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. rehearsals-ack Signifies that rehearsal jobs have been acknowledged

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants