[jira] [Commented] (HBASE-14362) org.apache.hadoop.hbase.master.procedure.TestWALProcedureStoreOnHDFS is super duper flaky

Heng Chen (JIRA) Mon, 28 Sep 2015 18:00:13 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-14362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14934427#comment-14934427
 ]


Heng Chen commented on HBASE-14362:
-----------------------------------

If i set param 
{code}
    conf.setInt("hbase.procedure.store.wal.max.roll.retries", 1);
    conf.setInt("hbase.procedure.store.wal.sync.failure.roll.max", 1);
{code}

This issue will be reproduced locally.  

I think the reason is, as original logic  
{code}
 store.insert(new TestProcedure(i, -1), null);
 waitForNumReplicas(3);
{code}

after we restart dn,  we insert immediately,  when dn is not started fully, the 
testcase will failed.

So we use {{waitForNumReplicas(3)}} before insert,  in 
{{waitForNumReplicas(3)}} it will wait dn start fully. 

after do that,  the testcase will not failed again locally.



> org.apache.hadoop.hbase.master.procedure.TestWALProcedureStoreOnHDFS is super 
> duper flaky
> -----------------------------------------------------------------------------------------
>
>                 Key: HBASE-14362
>                 URL: https://issues.apache.org/jira/browse/HBASE-14362
>             Project: HBase
>          Issue Type: Bug
>          Components: test
>    Affects Versions: 2.0.0
>            Reporter: Dima Spivak
>            Priority: Critical
>         Attachments: HBASE-14362.patch
>
>
> [As seen in 
> Jenkins|https://builds.apache.org/job/HBase-TRUNK/lastCompletedBuild/testReport/org.apache.hadoop.hbase.master.procedure/TestWALProcedureStoreOnHDFS/history/],
>  this test has been super flaky and we should probably address it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-14362) org.apache.hadoop.hbase.master.procedure.TestWALProcedureStoreOnHDFS is super duper flaky

Reply via email to