Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

re-enable all node flaky test, move node performance test to new job #19352

Closed
wants to merge 1 commit into from

Conversation

MHBauer
Copy link
Contributor

@MHBauer MHBauer commented Sep 24, 2020

Filtering of test runs in perf-image-config.yaml means only specific
tests matching "Node Performance Testing" run in flaky job.

  • Move flaky test job back to general image-config.yaml.
  • Create new job config specific to "Node Performance Testing".
  • Decrease ci interval of new "Node Performance Testing" job to 12h
    instead of 2h.

A separate PR in kubernetes/kubernetes needs to be made to take "Node
Performance Testing" tests out of the flaky job. PR #95040

Filtering of test runs in perf-image-config.yaml means only specific
tests matching "Node Performance Testing" run in flaky job.

 - Move flaky test job back to general image-config.yaml.
 - Create new job config specific to "Node Performance Testing".
 - Decrease ci interval of new "Node Performance Testing" job to 12h
   instead of 2h.

A separate PR in kubernetes/kubernetes needs to be made to take "Node
Performance Testing" tests out of the flaky job.
@k8s-ci-robot k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Sep 24, 2020
@k8s-ci-robot k8s-ci-robot added area/jobs sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. labels Sep 24, 2020
@MHBauer
Copy link
Contributor Author

MHBauer commented Oct 5, 2020

Previous work, #17687, including my own comment, #17687 (comment)

@MHBauer
Copy link
Contributor Author

MHBauer commented Oct 5, 2020

/assign @sjenning

@MHBauer
Copy link
Contributor Author

MHBauer commented Oct 5, 2020

kubernetes/kubernetes#95040 should go in first, probably.

@fejta-bot
Copy link

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 3, 2021
@spiffxp
Copy link
Member

spiffxp commented Jan 19, 2021

/test pull-test-infra-integration
#20498 (comment)

@MHBauer
Copy link
Contributor Author

MHBauer commented Jan 26, 2021

@SergeyKanzhelev This is still relevant, because without it, we're not running anything marked as 'Flaky' anywhere ever. It does mean they don't ever fail, but at that point, we can just delete all the tests.

Additionally, we should de-footgun this by making it an error to specify tests in the image if they are specified as Focus= as well.

TODO to make a list of all tests marked as FLAKY.

@fejta-bot
Copy link

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

@k8s-ci-robot k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 25, 2021
@fejta-bot
Copy link

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

@k8s-ci-robot
Copy link
Contributor

@fejta-bot: Closed this PR.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@spiffxp
Copy link
Member

spiffxp commented Mar 27, 2021

/remove-lifecycle rotten
/reopen
Given @MHBauer's comment above

@k8s-ci-robot k8s-ci-robot reopened this Mar 27, 2021
@k8s-ci-robot
Copy link
Contributor

@spiffxp: Reopened this PR.

In response to this:

/remove-lifecycle rotten
/reopen
Given @MHBauer's comment above

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@k8s-ci-robot k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Mar 27, 2021
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: MHBauer
To complete the pull request process, please ask for approval from sjenning after the PR has been reviewed.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@SergeyKanzhelev
Copy link
Member

/assign

@@ -189,7 +189,7 @@ periodics:
- --deployment=node
- --gcp-project-type=node-e2e-project
- --gcp-zone=us-west1-b
- --node-args=--image-config-file=/workspace/test-infra/jobs/e2e_node/perf-image-config.yaml
- --node-args=--image-config-file=/workspace/test-infra/jobs/e2e_node/image-config.yaml
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this tab repel the "Node Performance Testing" tests using the skip? I am very confused why this task picks up performance tests.

This definition says: - --test_args=--nodes=1 --focus="\[Flaky\]"

And name of the test doesn't include "Flaky":

E2eNode Suite.[sig-node] Node Performance Testing [Serial] [Slow] Run node performance testing with pre-defined workloads NAS parallel benchmark (NPB) suite - Embarrassingly Parallel (EP) workload [cos-stable1]E2eNode Suite.[sig-node] Node Performance Testing [Serial] [Slow] Run node performance testing with pre-defined workloads NAS parallel benchmark (NPB) suite - Embarrassingly Parallel (EP) workload [ubuntu]E2eNode Suite.[sig-node] Node Performance Testing [Serial] [Slow] Run node performance testing with pre-defined workloads NAS parallel benchmark (NPB) suite - Integer Sort (IS) workload [cos-stable1]E2eNode Suite.[sig-node] Node Performance Testing [Serial] [Slow] Run node performance testing with pre-defined workloads NAS parallel benchmark (NPB) suite - Integer Sort (IS) workload [ubuntu] 

@fejta-bot
Copy link

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 7, 2021
@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 3, 2021
@k8s-ci-robot
Copy link
Contributor

@MHBauer: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@SergeyKanzhelev
Copy link
Member

/cc @manugupt1

@k8s-ci-robot
Copy link
Contributor

@SergeyKanzhelev: GitHub didn't allow me to request PR reviews from the following users: manugupt1.

Note that only kubernetes members and repo collaborators can review this PR, and authors cannot review their own PRs.

In response to this:

/cc @manugupt1

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@ehashman
Copy link
Member

ehashman commented Aug 4, 2021

cc @manugupt1

@ehashman
Copy link
Member

ehashman commented Aug 5, 2021

/close

@manugupt1 to submit a follow-up

@k8s-ci-robot
Copy link
Contributor

@ehashman: Closed this PR.

In response to this:

/close

@manugupt1 to submit a follow-up

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

manugupt1 pushed a commit to manugupt1/test-infra that referenced this pull request Aug 5, 2021
From: kubernetes#19352

Filtering of test runs in perf-image-config.yaml means only specific
tests matching "Node Performance Testing" run in flaky job.

Move flaky test job back to general image-config.yaml.
Create new job config specific to "Node Performance Testing".
Decrease ci interval of new "Node Performance Testing" job to 12h
instead of 2h.
manugupt1 pushed a commit to manugupt1/test-infra that referenced this pull request Aug 5, 2021
From: kubernetes#19352

Filtering of test runs in perf-image-config.yaml means only specific
tests matching "Node Performance Testing" run in flaky job.

Move flaky test job back to general image-config.yaml.
Create new job config specific to "Node Performance Testing".
Decrease ci interval of new "Node Performance Testing" job to 12h
instead of 2h.
manugupt1 pushed a commit to manugupt1/test-infra that referenced this pull request Aug 11, 2021
From: kubernetes#19352

Filtering of test runs in perf-image-config.yaml means only specific
tests matching "Node Performance Testing" run in flaky job.

Move flaky test job back to general image-config.yaml.
Create new job config specific to "Node Performance Testing".
Decrease ci interval of new "Node Performance Testing" job to 12h
instead of 2h.
manugupt1 pushed a commit to manugupt1/test-infra that referenced this pull request Aug 11, 2021
From: kubernetes#19352

Filtering of test runs in perf-image-config.yaml means only specific
tests matching "Node Performance Testing" run in flaky job.

Move flaky test job back to general image-config.yaml.
Create new job config specific to "Node Performance Testing".
Decrease ci interval of new "Node Performance Testing" job to 12h
instead of 2h.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/config Issues or PRs related to code in /config area/jobs cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

7 participants