Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test flakiness is in a critical state #30824

Closed
BridgeAR opened this issue Dec 6, 2019 · 8 comments
Closed

Test flakiness is in a critical state #30824

BridgeAR opened this issue Dec 6, 2019 · 8 comments
Labels
arm Issues and PRs related to the ARM platform. build Issues and PRs related to build files or the CI. flaky-test Issues and PRs related to the tests with unstable failures on the CI. help wanted Issues that need assistance from volunteers or PRs that need help to proceed. linux Issues and PRs related to the Linux platform. test Issues and PRs related to the tests. windows Issues and PRs related to the Windows platform.

Comments

@BridgeAR
Copy link
Member

BridgeAR commented Dec 6, 2019

In the last 100 CI runs we only had 4 successful ones (https://nodejs-ci-health.mmarchini.me/#/job-summary). This is the worst ratio I can remember.

This is probably due to build issues and flaky tests together. I would just like to encourage everyone to have a look into some of these to get the CI results back to a level that we can better rely upon.

Most issues are around Windows (node-test-commit-windows-fanned) followed by arm (node-test-commit-arm-fanned) and containered builds (node-test-commit-linux-containered).

// cc @nodejs/collaborators

@BridgeAR BridgeAR added windows Issues and PRs related to the Windows platform. build Issues and PRs related to build files or the CI. test Issues and PRs related to the tests. arm Issues and PRs related to the ARM platform. flaky-test Issues and PRs related to the tests with unstable failures on the CI. linux Issues and PRs related to the Linux platform. labels Dec 6, 2019
@Trott
Copy link
Member

Trott commented Dec 6, 2019

There seem to be a number of jobs where the subtasks are all green but the main task is red. Hopefully just some network problems with the Jenkins server that have resolved, or something like that, but if it recurs, we'll definitely want to enlist @nodejs/build folks to take a look.....

@sam-github
Copy link
Contributor

https://ci.nodejs.org/job/node-daily-master/ is also useful as evidence of build stability: its had one green in the last 14 days, if master isn't passing tests in CI, that'll show up in PRs.

@sam-github
Copy link
Contributor

And fwiw, @AshCripps has been hacking a bit on @joyeecheung 's ncu-ci utility, to add support for reporting on master. There are always going to be some failures on PRs, so sorting out related-to-PR failures and unrelated might require human judgement, but master build failures are instability in the tests or CI infrastructure pretty much by definition.

@sam-github
Copy link
Contributor

openjs-foundation/summit#213 (comment) --- Perhaps worth having a community corner discussion on this? Though I see there's been a bunch of test fixes in the last days.

@joaocgreis
Copy link
Member

Just opened #30848 to mark a few tests as flaky, so we should see less red and more yellow. Getting to see more green is the next step, but it's not as simple so will take longer.

@BridgeAR
Copy link
Member Author

Just a heads up: recently a few flakes where looked at and we are back to 12 passing from 100. That is significantly better. I'll close this as soon as we reach 20 from 100. We should probably try to reach at least 25 passing though.

@BridgeAR BridgeAR added the help wanted Issues that need assistance from volunteers or PRs that need help to proceed. label Dec 25, 2019
@BridgeAR
Copy link
Member Author

Thanks a lot to everyone who looked into fixing some of these flaky tests by the way!

@BridgeAR
Copy link
Member Author

The CI is back to a "normal" level of flakes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arm Issues and PRs related to the ARM platform. build Issues and PRs related to build files or the CI. flaky-test Issues and PRs related to the tests with unstable failures on the CI. help wanted Issues that need assistance from volunteers or PRs that need help to proceed. linux Issues and PRs related to the Linux platform. test Issues and PRs related to the tests. windows Issues and PRs related to the Windows platform.
Projects
None yet
Development

No branches or pull requests

4 participants