[jira] [Commented] (HBASE-25389) [Flakey Tests] branch-2 TestMetaShutdownHandler

Hudson (Jira) Wed, 16 Dec 2020 09:53:16 -0800


    [ 
https://issues.apache.org/jira/browse/HBASE-25389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17250498#comment-17250498
 ]


Hudson commented on HBASE-25389:
--------------------------------

Results for branch branch-2
        [build #129 on 
builds.a.o|https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/129/]:
 (/) *{color:green}+1 overall{color}*
----
details (if available):

(/) {color:green}+1 general checks{color}
-- For more information [see general 
report|https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/129/General_20Nightly_20Build_20Report/]




(/) {color:green}+1 jdk8 hadoop2 checks{color}
-- For more information [see jdk8 (hadoop2) 
report|https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/129/JDK8_20Nightly_20Build_20Report_20_28Hadoop2_29/]


(/) {color:green}+1 jdk8 hadoop3 checks{color}
-- For more information [see jdk8 (hadoop3) 
report|https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/129/JDK8_20Nightly_20Build_20Report_20_28Hadoop3_29/]


(/) {color:green}+1 jdk11 hadoop3 checks{color}
-- For more information [see jdk11 
report|https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/129/JDK11_20Nightly_20Build_20Report_20_28Hadoop3_29/]


(/) {color:green}+1 source release artifact{color}
-- See build output for details.


(/) {color:green}+1 client integration test{color}


> [Flakey Tests] branch-2 TestMetaShutdownHandler
> -----------------------------------------------
>
>                 Key: HBASE-25389
>                 URL: https://issues.apache.org/jira/browse/HBASE-25389
>             Project: HBase
>          Issue Type: Task
>          Components: flakies
>            Reporter: Michael Stack
>            Assignee: Michael Stack
>            Priority: Major
>             Fix For: 3.0.0-alpha-1, 2.5.0, 2.4.1
>
>
> I see this in local runs fail regularly. We kill the server hosting meta and 
> then check it came up in a new location after waiting on recovery. In the 
> test, when it fails, the assert on new location fails because we have not 
> waited for the CRASH to happen. Here is excerpt from log:
> {code}
>  2020-12-11 13:20:27,298 INFO  [Listener at localhost/62149] 
> master.TestMetaShutdownHandler(111): Deleted the znode for the RegionServer 
> hosting hbase:meta; waiting on SSH
> ...
>  2020-12-11 13:20:27,310 INFO  [Listener at localhost/62149] 
> master.TestMetaShutdownHandler(122): Past wait on RIT
> ...
>  2020-12-11 13:20:27,351 DEBUG [RegionServerTracker-0] 
> procedure2.ProcedureExecutor(1048): Stored pid=9, 
> state=RUNNABLE:SERVER_CRASH_START; ServerCrashProcedure 
> stack.XXX.example.com,62201,1607721618377, splitWal=true, meta=true
> {code}
> The first line is where we remove the ephemeral node for the regionserver 
> carrying hbase:meta. The second line is supposed to log AFTER SCP is done (it 
> calls it SSH in this old test above). Notice how the 3rd line, after the 2nd, 
> is first mention of SCP being queued.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (HBASE-25389) [Flakey Tests] branch-2 TestMetaShutdownHandler

Reply via email to