[ https://issues.apache.org/jira/browse/HDFS-15032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16992719#comment-16992719 ]
Erik Krogen edited comment on HDFS-15032 at 12/10/19 4:35 PM: -------------------------------------------------------------- Thanks for the info [~shv], good to know. I've removed the {{toString()}} stuff in v5. After seeing the Jenkins failure, I experimented and found that the new test timed out half of the time (5 of 10 runs) when run on my machine, but it succeeded every time when I increased the timeout to 2 minutes. I think it just needs longer since there is more overhead involved with the failure handling. To avoid spurious failures, I increased the timeout for the failure test to 3 minutes, and for the non-failure observer test to 2 minutes. was (Author: xkrogen): Thanks for the info [~shv], good to know. I've removed the {{toString()}} stuff in v5. After seeing the Jenkins failure, I experimented and found that the new test timing out half of the time (5 of 10 runs) when run on my machine, but it succeeded every time when I increased the timeout to 2 minutes. I think it just needs longer since there is more overhead involved with the failure handling. To avoid spurious failures, I increased the timeout for the failure test to 3 minutes, and for the non-failure observer test to 2 minutes. > Balancer crashes when it fails to contact an unavailable NN via > ObserverReadProxyProvider > ----------------------------------------------------------------------------------------- > > Key: HDFS-15032 > URL: https://issues.apache.org/jira/browse/HDFS-15032 > Project: Hadoop HDFS > Issue Type: Bug > Components: balancer & mover > Affects Versions: 2.10.0 > Reporter: Erik Krogen > Assignee: Erik Krogen > Priority: Major > Attachments: HDFS-15032.000.patch, HDFS-15032.001.patch, > HDFS-15032.002.patch, HDFS-15032.003.patch, HDFS-15032.004.patch, > HDFS-15032.005.patch, debugger_with_tostring.png, > debugger_without_tostring.png > > > When trying to run the Balancer using ObserverReadProxyProvider (to allow it > to read from the Observer Node as described in HDFS-14979), if one of the NNs > isn't running, the Balancer will crash. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org