[ 
https://issues.apache.org/jira/browse/HBASE-13574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14514275#comment-14514275
 ] 

Andrey Stepachev commented on HBASE-13574:
------------------------------------------

It seems that test is broken due of the new create retry policy in hadoop 2.6. 
Namenode proxy now created with custom RetryPolicy for 
AlreadyBeingCreatedException which
is implies timeout on this operations up to 
HdfsConstants.LEASE_SOFTLIMIT_PERIOD (60seconds).
So we should expect that create can execute as minimum as this constant if file 
already exists.
So for now we can fix that by hardcoding greater timeouts to HBaseFsck (for now 
they 
are 15 seconds in test and 30seconds in HbaseFsck itself).
This change should not break production usage of HBaseFsck, except of it will 
take longer to
find out that other instance of HbaseFsck is running.

> Broken TestHBaseFsck in master
> ------------------------------
>
>                 Key: HBASE-13574
>                 URL: https://issues.apache.org/jira/browse/HBASE-13574
>             Project: HBase
>          Issue Type: Bug
>          Components: test
>    Affects Versions: 2.0.0
>            Reporter: Andrey Stepachev
>            Priority: Critical
>
> Got following exception and it reproducible (I can see it in recent tests 
> runs from other patches).
> {noformat}
> Running org.apache.hadoop.hbase.util.TestHBaseFsck
> Tests run: 51, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 348.628 sec 
> <<< FAILURE! - in org.apache.hadoop.hbase.util.TestHBaseFsck
> testParallelWithRetriesHbck(org.apache.hadoop.hbase.util.TestHBaseFsck)  Time 
> elapsed: 30.052 sec  <<< ERROR!
> java.util.concurrent.ExecutionException: java.io.IOException: Duplicate hbck 
> - Abort
>       at java.util.concurrent.FutureTask.report(FutureTask.java:122)
>       at java.util.concurrent.FutureTask.get(FutureTask.java:188)
>       at 
> org.apache.hadoop.hbase.util.TestHBaseFsck.testParallelWithRetriesHbck(TestHBaseFsck.java:634)
> Caused by: java.io.IOException: Duplicate hbck - Abort
>       at org.apache.hadoop.hbase.util.HBaseFsck.connect(HBaseFsck.java:473)
>       at 
> org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:53)
>       at 
> org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:43)
>       at 
> org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:38)
>       at 
> org.apache.hadoop.hbase.util.TestHBaseFsck$2RunHbck.call(TestHBaseFsck.java:625)
>       at 
> org.apache.hadoop.hbase.util.TestHBaseFsck$2RunHbck.call(TestHBaseFsck.java:621)
>       at java.util.concurrent.FutureTask.run(FutureTask.java:262)
>       at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>       at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to