[
https://issues.apache.org/jira/browse/HDFS-10985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mingliang Liu updated HDFS-10985:
---------------------------------
Description: {{TestZKFailoverController#testGracefulFailoverMultipleZKfcs}}
uses fixed time sleep before assertions. This may fail sometimes though 10
seconds are generally long enough. I think we can use
{{GenericTestUtils.waitFor()}} to retry the assertions. (was: h6. Error Message
{quote}
Unable to become active. Local node did not get an opportunity to do so from
ZooKeeper, or the local node took too long to transition to active.
{quote}
h6. Stacktrace
{quote}
org.apache.hadoop.ha.ServiceFailedException: Unable to become active. Local
node did not get an opportunity to do so from ZooKeeper, or the local node took
too long to transition to active.
at
org.apache.hadoop.ha.ZKFailoverController.doGracefulFailover(ZKFailoverController.java:690)
at
org.apache.hadoop.ha.ZKFailoverController.access$400(ZKFailoverController.java:62)
at
org.apache.hadoop.ha.ZKFailoverController$3.run(ZKFailoverController.java:607)
at
org.apache.hadoop.ha.ZKFailoverController$3.run(ZKFailoverController.java:604)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1795)
at
org.apache.hadoop.ha.ZKFailoverController.gracefulFailoverToYou(ZKFailoverController.java:604)
at
org.apache.hadoop.ha.ZKFCRpcServer.gracefulFailover(ZKFCRpcServer.java:94)
at
org.apache.hadoop.ha.TestZKFailoverController.testGracefulFailoverMultipleZKfcs(TestZKFailoverController.java:590)
{quote}
See recent failing build:
#
https://builds.apache.org/job/PreCommit-HADOOP-Build/10705/testReport/org.apache.hadoop.ha/TestZKFailoverController/testGracefulFailoverMultipleZKfcs/
# to add more
I think we can use {{GenericTestUtils.waitFor()}} to retry the assertions.)
> o.a.h.ha.TestZKFailoverController#testGracefulFailoverMultipleZKfcs fails
> intermittently
> ----------------------------------------------------------------------------------------
>
> Key: HDFS-10985
> URL: https://issues.apache.org/jira/browse/HDFS-10985
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: ha, test
> Reporter: Mingliang Liu
> Assignee: Mingliang Liu
> Attachments: HDFS-10985.000.patch
>
>
> {{TestZKFailoverController#testGracefulFailoverMultipleZKfcs}} uses fixed
> time sleep before assertions. This may fail sometimes though 10 seconds are
> generally long enough. I think we can use {{GenericTestUtils.waitFor()}} to
> retry the assertions.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]