-
Notifications
You must be signed in to change notification settings - Fork 588
HDDS-8374. Fix Flaky Unit Test for TestContainerStateCounts in Recon #4534
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
adoroszlai
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @ArafatKhan2198 for the fix. I'll try repeated runs. In the meantime, please fix checkstyle error.
| // then container is not added until the next container report is processed | ||
| StorageContainerDatanodeProtocolProtos.SCMHeartbeatRequestProto | ||
| heartbeatRequestProto = | ||
| StorageContainerDatanodeProtocolProtos.SCMHeartbeatRequestProto.newBuilder() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hadoop-ozone/recon/src/test/java/org/apache/hadoop/ozone/recon/api/TestContainerStateCounts.java
406: Line is longer than 80 characters (found 84).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for pointing it out. For some reason the ./checkStyle script in dev tools did not find it out. But I have changed it
|
@ArafatKhan2198 seems like the test failed in the first run: |
My apologise @adoroszlai. By mistake, I reverted back to the previous value after running the tests successfully. Please run the tests now. I have already run them a total of 20 times, and they have passed. |
|
Thanks @ArafatKhan2198 for the update, but the test is still flaky, failed in 14/100 runs: https://github.com/adoroszlai/hadoop-ozone/actions/runs/4610115793 |
|
Thanks @ArafatKhan2198 for updating the patch. Have you run repeated tests in CI? Can you please share the link to the CI run? |
|
Thanks @ArafatKhan2198 for updating the patch and for repeated runs, the results look good. I'll defer review of the actual fix to folks more familiar with Recon. |
|
@devmadhuu @adoroszlai Can you please take a look |
smengcl
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @ArafatKhan2198 . lgtm
9d251f6 to
4fb868f
Compare
|
Please try to avoid force-push when updating the PR. Here are some great articles that explain why: https://developers.mattermost.com/blog/submitting-great-prs/#4-avoid-force-pushing |
|
@ArafatKhan2198 I may be missing something, but the latest commit 3fb721a doesn't seem to be related to the fix. |
Hi @adoroszlai |
It would be better to avoid incorporating unrelated changes.
|
Understood, I have removed those changes. |

What changes were proposed in this pull request?
The testDeletedContainerCount() method of TestContainerStateCounts is considered flaky due to the processing order of the container report. If the pipeline does not exist, then the container is not added until the next container report is processed.
What is the link to the Apache JIRA
https://issues.apache.org/jira/browse/HDDS-8374
How was this patch tested?
Ran the UT several times to make sure it's not flaky