hemantk-12 opened a new pull request, #4749:
URL: https://github.com/apache/ozone/pull/4749
## What changes were proposed in this pull request?
`TestOzoneManagerHASnapshot` intermittently fails because DB is getting
closed due to Ratis's consistence state.
```
Error:
org.apache.hadoop.ozone.om.TestOzoneManagerHASnapshot.testSnapshotNameConsistency
Time elapsed: 3.094 s <<< ERROR!
java.lang.RuntimeException: java.io.IOException: Rocks Database is closed
at
org.apache.hadoop.ozone.om.TestOzoneManagerHASnapshot.lambda$testSnapshotNameConsistency$0(TestOzoneManagerHASnapshot.java:76)
at
org.awaitility.core.CallableCondition$ConditionEvaluationWrapper.eval(CallableCondition.java:99)
at
org.awaitility.core.ConditionAwaiter$ConditionPoller.call(ConditionAwaiter.java:248)
at
org.awaitility.core.ConditionAwaiter$ConditionPoller.call(ConditionAwaiter.java:235)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Suppressed: org.apache.ratis.util.ExitUtils$ExitException: Cannot load
OM DB as it is in an inconsistent state.
at org.apache.ratis.util.ExitUtils.terminate(ExitUtils.java:141)
at org.apache.ratis.util.ExitUtils.terminate(ExitUtils.java:151)
at org.apache.ratis.util.ExitUtils.terminate(ExitUtils.java:155)
```
`TestOzoneManagerHASnapshot` inherits `TestOzoneManagerHA` and uses cluster
created by the base class which is statically initialized by `BeforeAll`.
Although we don't run test parallelly, there is some possibility one test is
interfering with other one and causing inconsistency.
This change is to create mini cluster for `TestOzoneManagerHASnapshot`
instead of using inherited one.
## What is the link to the Apache JIRA
https://issues.apache.org/jira/browse/HDDS-8645
## How was this patch tested?
Ran this test 25 times in workflow and it passed all the time.
Run:
https://github.com/hemantk-12/ozone/actions/runs/5019667607/jobs/9000405979
I tired to run 50 times too but it failed due to disk space:
https://github.com/hemantk-12/ozone/actions/runs/5019375127/jobs/8999791082
CI/CD passed in one go on fork:
https://github.com/hemantk-12/ozone/actions/runs/5019860313/jobs/9000784799
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]