[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16671508#comment-16671508 ] Duo Zhang commented on HBASE-21334: --- OK finally I got the stack trace... {noformat} 2018-10-31 16:22:01,302 ERROR [Time-limited test] assignment.TestMergeTableRegionsProcedure(305): error! java.lang.NullPointerException at org.apache.hadoop.hbase.master.assignment.MergeTableRegionsProcedure.getStateId(MergeTableRegionsProcedure.java:386) at org.apache.hadoop.hbase.master.assignment.MergeTableRegionsProcedure.getStateId(MergeTableRegionsProcedure.java:84) at org.apache.hadoop.hbase.procedure2.StateMachineProcedure.getCurrentStateId(StateMachineProcedure.java:276) at org.apache.hadoop.hbase.master.procedure.MasterProcedureTestingUtility.testRecoveryAndDoubleExecution(MasterProcedureTestingUtility.java:414) at org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:296) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50) at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47) at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55) at org.junit.rules.RunRules.evaluate(RunRules.java:20) at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325) at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78) at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57) at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:298) at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:292) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.lang.Thread.run(Thread.java:748) {noformat} > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16659150#comment-16659150 ] Hudson commented on HBASE-21334: Results for branch master [build #561 on builds.a.o|https://builds.apache.org/job/HBase%20Nightly/job/master/561/]: (x) *{color:red}-1 overall{color}* details (if available): (/) {color:green}+1 general checks{color} -- For more information [see general report|https://builds.apache.org/job/HBase%20Nightly/job/master/561//General_Nightly_Build_Report/] (x) {color:red}-1 jdk8 hadoop2 checks{color} -- For more information [see jdk8 (hadoop2) report|https://builds.apache.org/job/HBase%20Nightly/job/master/561//JDK8_Nightly_Build_Report_(Hadoop2)/] (x) {color:red}-1 jdk8 hadoop3 checks{color} -- For more information [see jdk8 (hadoop3) report|https://builds.apache.org/job/HBase%20Nightly/job/master/561//JDK8_Nightly_Build_Report_(Hadoop3)/] (/) {color:green}+1 source release artifact{color} -- See build output for details. (x) {color:red}-1 client integration test{color} -- Something went wrong with this stage, [check relevant console output|https://builds.apache.org/job/HBase%20Nightly/job/master/561//console]. > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658928#comment-16658928 ] Hudson commented on HBASE-21334: Results for branch branch-2 [build #1426 on builds.a.o|https://builds.apache.org/job/HBase%20Nightly/job/branch-2/1426/]: (x) *{color:red}-1 overall{color}* details (if available): (/) {color:green}+1 general checks{color} -- For more information [see general report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2/1426//General_Nightly_Build_Report/] (x) {color:red}-1 jdk8 hadoop2 checks{color} -- For more information [see jdk8 (hadoop2) report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2/1426//JDK8_Nightly_Build_Report_(Hadoop2)/] (/) {color:green}+1 jdk8 hadoop3 checks{color} -- For more information [see jdk8 (hadoop3) report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2/1426//JDK8_Nightly_Build_Report_(Hadoop3)/] (/) {color:green}+1 source release artifact{color} -- See build output for details. (/) {color:green}+1 client integration test{color} > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658906#comment-16658906 ] Hudson commented on HBASE-21334: Results for branch branch-2.1 [build #511 on builds.a.o|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.1/511/]: (/) *{color:green}+1 overall{color}* details (if available): (/) {color:green}+1 general checks{color} -- For more information [see general report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.1/511//General_Nightly_Build_Report/] (/) {color:green}+1 jdk8 hadoop2 checks{color} -- For more information [see jdk8 (hadoop2) report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.1/511//JDK8_Nightly_Build_Report_(Hadoop2)/] (/) {color:green}+1 jdk8 hadoop3 checks{color} -- For more information [see jdk8 (hadoop3) report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.1/511//JDK8_Nightly_Build_Report_(Hadoop3)/] (/) {color:green}+1 source release artifact{color} -- See build output for details. (/) {color:green}+1 client integration test{color} > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658908#comment-16658908 ] Hudson commented on HBASE-21334: Results for branch branch-2.0 [build #994 on builds.a.o|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.0/994/]: (x) *{color:red}-1 overall{color}* details (if available): (/) {color:green}+1 general checks{color} -- For more information [see general report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.0/994//General_Nightly_Build_Report/] (/) {color:green}+1 jdk8 hadoop2 checks{color} -- For more information [see jdk8 (hadoop2) report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.0/994//JDK8_Nightly_Build_Report_(Hadoop2)/] (x) {color:red}-1 jdk8 hadoop3 checks{color} -- For more information [see jdk8 (hadoop3) report|https://builds.apache.org/job/HBase%20Nightly/job/branch-2.0/994//JDK8_Nightly_Build_Report_(Hadoop3)/] (/) {color:green}+1 source release artifact{color} -- See build output for details. > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658466#comment-16658466 ] Duo Zhang commented on HBASE-21334: --- No useful information in the log. Anyway let me commit the patch to all branches first, at least it solves one of the problems. Will keep an eye on the flakey dashboard. > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658455#comment-16658455 ] Duo Zhang commented on HBASE-21334: --- Much stable now but TestMergeTableRegionsProcedure still failed once, a NPE... Let me dig. > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658240#comment-16658240 ] Duo Zhang commented on HBASE-21334: --- Pushed to master. Let's see how it works. > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2, test >Reporter: Duo Zhang >Assignee: Duo Zhang >Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.1, 2.0.3 > > Attachments: HBASE-21334.patch, > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658235#comment-16658235 ] Hadoop QA commented on HBASE-21334: --- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 49s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} hbaseanti {color} | {color:green} 0m 0s{color} | {color:green} Patch does not have any anti-patterns. {color} | | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 8m 14s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 54s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 1m 7s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} shadedjars {color} | {color:green} 4m 5s{color} | {color:green} branch has no errors when building our shaded downstream artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 58s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 29s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 4m 45s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 45s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 45s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 1m 6s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedjars {color} | {color:green} 4m 10s{color} | {color:green} patch has no errors when building our shaded downstream artifacts. {color} | | {color:green}+1{color} | {color:green} hadoopcheck {color} | {color:green} 10m 0s{color} | {color:green} Patch does not cause any errors with Hadoop 2.7.4 or 3.0.0. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 2m 6s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 29s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red}247m 22s{color} | {color:red} hbase-server in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 22s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black}291m 9s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | hadoop.hbase.io.asyncfs.TestSaslFanOutOneBlockAsyncDFSOutput | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hbase:b002b0b | | JIRA Issue | HBASE-21334 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12944885/HBASE-21334.patch | | Optional Tests | dupname asflicense javac javadoc unit findbugs shadedjars hadoopcheck hbaseanti checkstyle compile | | uname | Linux fcf4a43f965b 4.4.0-133-generic #159-Ubuntu SMP Fri Aug 10 07:31:43 UTC 2018 x86_64 GNU/Linux | | Build tool | maven | | Personality | /home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/component/dev-support/hbase-personality.sh | | git revision | master / ae5308ac4a | | maven | version: Apache Maven 3.5.4 (1edded0938998edf8bf061f1ceb3cfdeccf443fe; 2018-06-17T18:33:14Z) | | Default Java | 1.8.0_181 | | findbugs | v3.1.0-RC3 | | unit | https://builds.apache.org/job/PreCommit-HBASE-Build/14780/artifact/patchprocess/patch-unit-hbase-server.txt | | Test Results | https://builds.apache.org/job/PreCommit-HBASE-Build/14780/testReport/ | | Max. process+thread count | 4851 (vs. ulimit of 1) | | modules | C: hbase-server U: hbase-server | |
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658121#comment-16658121 ] Duo Zhang commented on HBASE-21334: --- OK I think this is test issue. For MergeTableRegionsProcedure and SplitTableRegionProcedure, we will schedule TRSPs to bring the region online, and since the MergeTableRegionsProcedure or SplitTableRegionProcedure still holds the lock when rolling back, the TRSPs can only be executed after the rollback is finished, and since we have set kill after every step so these TRSPs may also be effected. We have a piece of code in MasterProcedureTestingUtility to deal with this but obviously it does not always work... {code} if (waitForAsyncProcs) { // Sometimes there are other procedures still executing (including asynchronously spawned by // procId) and due to KillAndToggleBeforeStoreUpdate flag ProcedureExecutor is stopped before // store update. Let all pending procedures finish normally. if (!procExec.isRunning()) { LOG.warn("ProcedureExecutor not running, may have been stopped by pending procedure due to" + " KillAndToggleBeforeStoreUpdate flag."); ProcedureTestingUtility.setKillAndToggleBeforeStoreUpdate(procExec, false); restartMasterProcedureExecutor(procExec); ProcedureTestingUtility.waitNoProcedureRunning(procExec); } } {code} Let me think how to make it more stable... > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2 >Reporter: Duo Zhang >Priority: Major > Attachments: > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16655179#comment-16655179 ] Duo Zhang commented on HBASE-21334: --- No sure. It does not always happen, and the failing methods are not only the rollback one. Let me dig more. Maybe first I need to reimplement the WALProcedureMap, it is too complicated. > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2 >Reporter: Duo Zhang >Priority: Major > Attachments: > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16655160#comment-16655160 ] Allan Yang commented on HBASE-21334: is it with HBASE-21278 or not? > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2 >Reporter: Duo Zhang >Priority: Major > Attachments: > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654606#comment-16654606 ] Duo Zhang commented on HBASE-21334: --- {noformat} 2018-10-17 21:47:44,999 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(444): Corrupt pid=22, ppid=19, state=RUNNABLE, hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure 2018-10-17 21:47:45,007 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(444): Corrupt pid=19, ppid=18, state=WAITING:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; TransitRegionStateProcedure table=testMergeWithoutPONR, region=3b7371ecf932aa0f7fa0b9a03df56bf2, UNASSIGN 2018-10-17 21:47:45,008 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(444): Corrupt pid=20, ppid=18, state=RUNNABLE:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; TransitRegionStateProcedure table=testMergeWithoutPONR, region=55f7f23154a35661b02f91ff421b58d5, UNASSIGN 2018-10-17 21:47:45,009 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(444): Corrupt pid=21, ppid=20, state=SUCCESS, hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure 2018-10-17 21:47:45,010 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(444): Corrupt pid=18, state=WAITING:MERGE_TABLE_REGIONS_CHECK_CLOSED_REGIONS, hasLock=false; MergeTableRegionsProcedure table=testMergeWithoutPONR, regions=[3b7371ecf932aa0f7fa0b9a03df56bf2, 55f7f23154a35661b02f91ff421b58d5], forcibly=false {noformat} It is a bit strange, we have the root procedure there(pid = 18). Let me dig more... > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2 >Reporter: Duo Zhang >Priority: Major > Attachments: > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure-output.txt > > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HBASE-21334) TestMergeTableRegionsProcedure is flakey
[ https://issues.apache.org/jira/browse/HBASE-21334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654589#comment-16654589 ] Duo Zhang commented on HBASE-21334: --- I've also seen the 'corrupted procedure(s)' on our testing cluster. Let me dig. > TestMergeTableRegionsProcedure is flakey > > > Key: HBASE-21334 > URL: https://issues.apache.org/jira/browse/HBASE-21334 > Project: HBase > Issue Type: Bug > Components: amv2, proc-v2 >Reporter: Duo Zhang >Priority: Major > > {noformat} > Error Message > found 5 corrupted procedure(s) on replay > Stacktrace > java.io.IOException: found 5 corrupted procedure(s) on replay > at > org.apache.hadoop.hbase.master.assignment.TestMergeTableRegionsProcedure.testMergeWithoutPONR(TestMergeTableRegionsProcedure.java:295) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)