errose28 commented on PR #3534:
URL: https://github.com/apache/ozone/pull/3534#issuecomment-1165872080
Looks like there is an intermittent failure in
`TestScmHaFinalization#testSnapshotFinalization`. However, it is the leader,
not the SCM that finalized from a snapshot, that hit the error:
```
2022-06-24 00:37:59,413 [Listener at 127.0.0.1/43961] INFO
upgrade.TestScmHAFinalization
(TestScmHAFinalization.java:testSnapshotFinalization(251)) - Inactive SCM node
ID: scmNode-3
...
2022-06-24 00:29:26,723 [Listener at 0.0.0.0/35361] INFO
upgrade.TestHddsUpgradeUtils
(TestHddsUpgradeUtils.java:lambda$waitForFinalization$0(76)) - Waiting for
upgrade finalization to complete from client. Current status is
FINALIZATION_DONE.
2022-06-24 00:29:26,725 [Listener at 0.0.0.0/35361] INFO
upgrade.TestScmHAFinalization
(TestScmHAFinalization.java:lambda$waitForScmToFinalize$2(298)) - Waiting for
SCM scmNode-2 (leader? false) to finalize. Current finalization checkpoint is
FINALIZATION_COMPLETE
2022-06-24 00:29:26,725 [Listener at 0.0.0.0/35361] INFO
upgrade.TestScmHAFinalization
(TestScmHAFinalization.java:lambda$waitForScmToFinalize$2(298)) - Waiting for
SCM scmNode-3 (leader? false) to finalize. Current finalization checkpoint is
FINALIZATION_COMPLETE
2022-06-24 00:29:26,725 [Listener at 0.0.0.0/35361] INFO
upgrade.TestScmHAFinalization
(TestScmHAFinalization.java:lambda$waitForScmToFinalize$2(298)) - Waiting for
SCM scmNode-1 (leader? true) to finalize. Current finalization checkpoint is
FINALIZATION_COMPLETE
2022-06-24 00:29:26,725 [Listener at 0.0.0.0/35361] INFO
upgrade.TestHddsUpgradeUtils
(TestHddsUpgradeUtils.java:testPostUpgradeConditionsSCM(103)) - Testing post
upgrade conditions on SCM with node ID: scmNode-2
2022-06-24 00:29:26,728 [Listener at 0.0.0.0/35361] INFO
upgrade.TestHddsUpgradeUtils
(TestHddsUpgradeUtils.java:testPostUpgradeConditionsSCM(103)) - Testing post
upgrade conditions on SCM with node ID: scmNode-3
2022-06-24 00:29:26,729 [Listener at 0.0.0.0/35361] INFO
upgrade.TestHddsUpgradeUtils
(TestHddsUpgradeUtils.java:testPostUpgradeConditionsSCM(103)) - Testing post
upgrade conditions on SCM with node ID: scmNode-1
2022-06-24 00:29:26,739 [Listener at 0.0.0.0/35361] INFO
ozone.MiniOzoneClusterImpl (MiniOzoneClusterImpl.java:shutdown(448)) - Shutting
down the Mini Ozone Cluster
```
The error is
```
org.apache.hadoop.hdds.upgrade.TestScmHAFinalization.testSnapshotFinalization
Time elapsed: 73.328 s <<< FAILURE!
java.lang.AssertionError
at org.junit.Assert.fail(Assert.java:87)
at org.junit.Assert.assertTrue(Assert.java:42)
at org.junit.Assert.assertTrue(Assert.java:53)
at
org.apache.hadoop.hdds.upgrade.TestHddsUpgradeUtils.testPostUpgradeConditionsSCM(TestHddsUpgradeUtils.java:112)
at
org.apache.hadoop.hdds.upgrade.TestHddsUpgradeUtils.testPostUpgradeConditionsSCM(TestHddsUpgradeUtils.java:105)
at
org.apache.hadoop.hdds.upgrade.TestScmHAFinalization.testSnapshotFinalization(TestScmHAFinalization.java:266)
```
Which indicates the leader scmNode-1 did not cross the
`FINALIZATION_COMPLETE` checkpoint in this check, even though the logs show
that it already had earlier. I will need investigate this.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]