[ https://issues.apache.org/jira/browse/HBASE-13574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14514275#comment-14514275 ]
Andrey Stepachev commented on HBASE-13574: ------------------------------------------ It seems that test is broken due of the new create retry policy in hadoop 2.6. Namenode proxy now created with custom RetryPolicy for AlreadyBeingCreatedException which is implies timeout on this operations up to HdfsConstants.LEASE_SOFTLIMIT_PERIOD (60seconds). So we should expect that create can execute as minimum as this constant if file already exists. So for now we can fix that by hardcoding greater timeouts to HBaseFsck (for now they are 15 seconds in test and 30seconds in HbaseFsck itself). This change should not break production usage of HBaseFsck, except of it will take longer to find out that other instance of HbaseFsck is running. > Broken TestHBaseFsck in master > ------------------------------ > > Key: HBASE-13574 > URL: https://issues.apache.org/jira/browse/HBASE-13574 > Project: HBase > Issue Type: Bug > Components: test > Affects Versions: 2.0.0 > Reporter: Andrey Stepachev > Priority: Critical > > Got following exception and it reproducible (I can see it in recent tests > runs from other patches). > {noformat} > Running org.apache.hadoop.hbase.util.TestHBaseFsck > Tests run: 51, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 348.628 sec > <<< FAILURE! - in org.apache.hadoop.hbase.util.TestHBaseFsck > testParallelWithRetriesHbck(org.apache.hadoop.hbase.util.TestHBaseFsck) Time > elapsed: 30.052 sec <<< ERROR! > java.util.concurrent.ExecutionException: java.io.IOException: Duplicate hbck > - Abort > at java.util.concurrent.FutureTask.report(FutureTask.java:122) > at java.util.concurrent.FutureTask.get(FutureTask.java:188) > at > org.apache.hadoop.hbase.util.TestHBaseFsck.testParallelWithRetriesHbck(TestHBaseFsck.java:634) > Caused by: java.io.IOException: Duplicate hbck - Abort > at org.apache.hadoop.hbase.util.HBaseFsck.connect(HBaseFsck.java:473) > at > org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:53) > at > org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:43) > at > org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:38) > at > org.apache.hadoop.hbase.util.TestHBaseFsck$2RunHbck.call(TestHBaseFsck.java:625) > at > org.apache.hadoop.hbase.util.TestHBaseFsck$2RunHbck.call(TestHBaseFsck.java:621) > at java.util.concurrent.FutureTask.run(FutureTask.java:262) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)