Skip to content

e2e: stalld and sysctl_d_override e2e test changes#298

Merged
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
jmencak:4.10-e2e-fixes
Dec 13, 2021
Merged

e2e: stalld and sysctl_d_override e2e test changes#298
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
jmencak:4.10-e2e-fixes

Conversation

@jmencak
Copy link
Copy Markdown
Contributor

@jmencak jmencak commented Dec 13, 2021

Simplify the e2e stalld test. It is unnecessarily complex and only adds
to the time it takes to pass all e2e tests by approximately 1 node
reboot. Add more debugging for any future troubleshooting.

Adjust the sysctl_d_override golang e2e test to show the contents of the
sysctl override file for debugging and synchronize [fsync()] it to
persistent storage after write.

Simplify the e2e stalld test.  It is unnecessarily complex and only adds
to the time it takes to pass all e2e tests by approximately 1 node
reboot.  Add more debugging for any future troubleshooting.

Adjust the sysctl_d_override golang e2e test to show the contents of the
sysctl override file for debugging and synchronize [fsync()] it to
persistent storage after write.
@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented Dec 13, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jmencak

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci Bot requested review from dagrayvid and kpouget December 13, 2021 07:20
@openshift-ci openshift-ci Bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 13, 2021
@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

sigh
/retest verify

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented Dec 13, 2021

@jmencak: The /retest command does not accept any targets.
The following commands are available to trigger required jobs:

  • /test e2e-aws
  • /test e2e-aws-operator
  • /test e2e-upgrade
  • /test images
  • /test unit
  • /test verify

Use /test all to run all jobs.

Details

In response to this:

sigh
/retest verify

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

/test verify

@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

A note to reviewers. I've tested this by running ~500 e2e tests on 4.10 with no issues, so hopefully the most prominent operator e2e tests issues are resolved now. (Famous last words...).

@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

/retest

@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

OK, time to look into this again:

$ git clone https://github.com/openshift/cluster-node-tuning-operator
Cloning into 'cluster-node-tuning-operator'...
remote: Enumerating objects: 27663, done.
remote: Counting objects: 100% (4050/4050), done.
remote: Compressing objects: 100% (2057/2057), done.
remote: Total 27663 (delta 1959), reused 3472 (delta 1836), pack-reused 23613
Receiving objects: 100% (27663/27663), 31.51 MiB | 12.71 MiB/s, done.
Resolving deltas: 100% (12726/12726), done.

$ git fetch origin pull/298/head:298
remote: Enumerating objects: 15, done.
remote: Counting objects: 100% (15/15), done.
remote: Compressing objects: 100% (3/3), done.
remote: Total 8 (delta 5), reused 8 (delta 5), pack-reused 0
Unpacking objects: 100% (8/8), 1013 bytes | 202.00 KiB/s, done.
From https://github.com/openshift/cluster-node-tuning-operator
 * [new ref]           refs/pull/298/head -> 298

$ git checkout 298
Switched to branch '298'

$ make verify
Installing controller-gen into '_output/tools/bin/controller-gen-v0.6.0'
mkdir -p '_output/tools/bin/'
curl -s -f -L https://github.com/openshift/kubernetes-sigs-controller-tools/releases/download/v0.6.0/controller-gen-linux-amd64 -o '_output/tools/bin/controller-gen-v0.6.0'
chmod +x '_output/tools/bin/controller-gen-v0.6.0';
Installing yq into '_output/tools/bin/yq-2.4.0'
mkdir -p '_output/tools/bin/'
curl -s -f -L https://github.com/mikefarah/yq/releases/download/2.4.0/yq_linux_amd64 -o '_output/tools/bin/yq-2.4.0'
chmod +x '_output/tools/bin/yq-2.4.0';
Installing yaml-patch into '_output/tools/bin/yaml-patch-v0.0.10'
mkdir -p '_output/tools/bin/'
curl -s -f -L https://github.com/krishicks/yaml-patch/releases/download/v0.0.10/yaml_patch_linux -o '_output/tools/bin/yaml-patch-v0.0.10'
make: *** [vendor/github.com/openshift/build-machinery-go/make/targets/openshift/operator/../yaml-patch.mk:18: ensure-yaml-patch] Error 22

$ wget https://github.com/krishicks/yaml-patch/releases/download/v0.0.10/yaml_patch_linux
--2021-12-13 11:25:11--  https://github.com/krishicks/yaml-patch/releases/download/v0.0.10/yaml_patch_linux
Resolving github.com (github.com)... 140.82.121.3
Connecting to github.com (github.com)|140.82.121.3|:443... connected.
HTTP request sent, awaiting response... 404 Not Found
2021-12-13 11:25:11 ERROR 404: Not Found.

$ wget https://github.com/krishicks/yaml-patch/
--2021-12-13 11:29:02--  https://github.com/krishicks/yaml-patch/
Resolving github.com (github.com)... 140.82.121.4
Connecting to github.com (github.com)|140.82.121.4|:443... connected.
HTTP request sent, awaiting response... 404 Not Found
2021-12-13 11:29:03 ERROR 404: Not Found.

Awesome... As the tests pass with my local copy
/override verify
for the time being.

@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

/override ci/prow/verify

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented Dec 13, 2021

@jmencak: Overrode contexts on behalf of jmencak: ci/prow/verify

Details

In response to this:

/override ci/prow/verify

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented Dec 13, 2021

@jmencak: /override requires a failed status context or a job name to operate on.
The following unknown contexts were given:

  • verify

Only the following contexts were expected:

  • ci/prow/e2e-aws
  • ci/prow/e2e-aws-operator
  • ci/prow/e2e-upgrade
  • ci/prow/images
  • ci/prow/unit
  • ci/prow/verify
  • pull-ci-openshift-cluster-node-tuning-operator-release-4.1-e2e-aws
  • pull-ci-openshift-cluster-node-tuning-operator-release-4.1-images
  • pull-ci-openshift-cluster-node-tuning-operator-release-4.1-unit
  • pull-ci-openshift-cluster-node-tuning-operator-release-4.1-verify
  • pull-ci-openshift-cluster-node-tuning-operator-release-4.10-e2e-aws-operator
  • pull-ci-openshift-cluster-node-tuning-operator-release-4.10-e2e-upgrade
  • tide
Details

In response to this:

OK, time to look into this again:

$ git clone https://github.com/openshift/cluster-node-tuning-operator
Cloning into 'cluster-node-tuning-operator'...
remote: Enumerating objects: 27663, done.
remote: Counting objects: 100% (4050/4050), done.
remote: Compressing objects: 100% (2057/2057), done.
remote: Total 27663 (delta 1959), reused 3472 (delta 1836), pack-reused 23613
Receiving objects: 100% (27663/27663), 31.51 MiB | 12.71 MiB/s, done.
Resolving deltas: 100% (12726/12726), done.

$ git fetch origin pull/298/head:298
remote: Enumerating objects: 15, done.
remote: Counting objects: 100% (15/15), done.
remote: Compressing objects: 100% (3/3), done.
remote: Total 8 (delta 5), reused 8 (delta 5), pack-reused 0
Unpacking objects: 100% (8/8), 1013 bytes | 202.00 KiB/s, done.
From https://github.com/openshift/cluster-node-tuning-operator
* [new ref]           refs/pull/298/head -> 298

$ git checkout 298
Switched to branch '298'

$ make verify
Installing controller-gen into '_output/tools/bin/controller-gen-v0.6.0'
mkdir -p '_output/tools/bin/'
curl -s -f -L https://github.com/openshift/kubernetes-sigs-controller-tools/releases/download/v0.6.0/controller-gen-linux-amd64 -o '_output/tools/bin/controller-gen-v0.6.0'
chmod +x '_output/tools/bin/controller-gen-v0.6.0';
Installing yq into '_output/tools/bin/yq-2.4.0'
mkdir -p '_output/tools/bin/'
curl -s -f -L https://github.com/mikefarah/yq/releases/download/2.4.0/yq_linux_amd64 -o '_output/tools/bin/yq-2.4.0'
chmod +x '_output/tools/bin/yq-2.4.0';
Installing yaml-patch into '_output/tools/bin/yaml-patch-v0.0.10'
mkdir -p '_output/tools/bin/'
curl -s -f -L https://github.com/krishicks/yaml-patch/releases/download/v0.0.10/yaml_patch_linux -o '_output/tools/bin/yaml-patch-v0.0.10'
make: *** [vendor/github.com/openshift/build-machinery-go/make/targets/openshift/operator/../yaml-patch.mk:18: ensure-yaml-patch] Error 22

$ wget https://github.com/krishicks/yaml-patch/releases/download/v0.0.10/yaml_patch_linux
--2021-12-13 11:25:11--  https://github.com/krishicks/yaml-patch/releases/download/v0.0.10/yaml_patch_linux
Resolving github.com (github.com)... 140.82.121.3
Connecting to github.com (github.com)|140.82.121.3|:443... connected.
HTTP request sent, awaiting response... 404 Not Found
2021-12-13 11:25:11 ERROR 404: Not Found.

$ wget https://github.com/krishicks/yaml-patch/
--2021-12-13 11:29:02--  https://github.com/krishicks/yaml-patch/
Resolving github.com (github.com)... 140.82.121.4
Connecting to github.com (github.com)|140.82.121.4|:443... connected.
HTTP request sent, awaiting response... 404 Not Found
2021-12-13 11:29:03 ERROR 404: Not Found.

Awesome... As the tests pass with my local copy
/override verify
for the time being.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

The verify issue is fixed by:
openshift/build-machinery-go#58
we need to vendor it in.

@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

#299 should fix the verify step.

@kpouget
Copy link
Copy Markdown

kpouget commented Dec 13, 2021

/test verify

@kpouget
Copy link
Copy Markdown

kpouget commented Dec 13, 2021

thanks @jmencak, the PR looks good to me

/lgtm

/hold
I let you unhold to merge when you want

@openshift-ci openshift-ci Bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Dec 13, 2021
@openshift-ci openshift-ci Bot added the lgtm Indicates that a PR is ready to be merged. label Dec 13, 2021
@jmencak
Copy link
Copy Markdown
Contributor Author

jmencak commented Dec 13, 2021

Thank you Kevin!
/hold cancel

@openshift-ci openshift-ci Bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Dec 13, 2021
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

1 similar comment
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented Dec 13, 2021

@jmencak: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-merge-robot openshift-merge-robot merged commit bc9e58a into openshift:master Dec 13, 2021
@jmencak jmencak deleted the 4.10-e2e-fixes branch December 14, 2021 07:02
IlyaTyomkin pushed a commit to IlyaTyomkin/cluster-node-tuning-operator that referenced this pull request May 23, 2023
Simplify the e2e stalld test.  It is unnecessarily complex and only adds
to the time it takes to pass all e2e tests by approximately 1 node
reboot.  Add more debugging for any future troubleshooting.

Adjust the sysctl_d_override golang e2e test to show the contents of the
sysctl override file for debugging and synchronize [fsync()] it to
persistent storage after write.

Co-authored-by: Jiri Mencak <jmencak@users.noreply.github.com>
IlyaTyomkin pushed a commit to IlyaTyomkin/cluster-node-tuning-operator that referenced this pull request Jun 13, 2023
Simplify the e2e stalld test.  It is unnecessarily complex and only adds
to the time it takes to pass all e2e tests by approximately 1 node
reboot.  Add more debugging for any future troubleshooting.

Adjust the sysctl_d_override golang e2e test to show the contents of the
sysctl override file for debugging and synchronize [fsync()] it to
persistent storage after write.

Co-authored-by: Jiri Mencak <jmencak@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants