-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix unit test testFailsHealthOnHungIOBeyondHealthyTimeout() by incresing the max waiting time before assertion #1692
Conversation
…toring the FS status in the test Signed-off-by: Tianli Feng <[email protected]>
Signed-off-by: Tianli Feng <[email protected]>
Can one of the admins verify this patch? |
Hi @Bukhtawar, I would like to hear your opinion on this change. 😁 Thank you. |
I'm going to merge it since it visibly reduces the amount of test failures. |
This failed to backport automatically in https://github.com/opensearch-project/OpenSearch/runs/4486636906?check_suite_focus=true and will need to be done by hand, please, cc: @tlfeng. |
Hi @dblock, I realized that the PR #1167 that introduced the test |
@tlfeng can you confirm if the test fixes have been merged to 1.x? I tried pulling in these changes as well |
Description
Try to fix the unit test
testFailsHealthOnHungIOBeyondHealthyTimeout()
in classFsHealthServiceTests
, according to the idea in comment #1567 (comment)2x
multiplier to3x
that applied torefreshInterval
.Note:
1
The test failure can not be reproduced easily locally, and only sometime shows up in the CI workflow. I guess the reason of the test failure is there are dozens of file directories have to be restoring from the IO hanging status in the below step,
OpenSearch/server/src/test/java/org/opensearch/monitor/fs/FsHealthServiceTests.java
Lines 237 to 238 in 538b40a
but the max waiting time in below code is not enough for all those file directories restored to a healthy state.
OpenSearch/server/src/test/java/org/opensearch/monitor/fs/FsHealthServiceTests.java
Line 242 in 538b40a
2
Before removing the duplicate assertion statement:
After removing, the error message can be more intuitive:
Issues Resolved
#1567 - I wish it could be resolved after the PR.
#1307 and #1450 - These 2 issues reporting the same test failure as #1567.
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.