[ https://issues.apache.org/jira/browse/HBASE-21377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Duo Zhang updated HBASE-21377: ------------------------------ Summary: Missing procedure stack index when restarting (was: Missing stack id when restarting) > Missing procedure stack index when restarting > --------------------------------------------- > > Key: HBASE-21377 > URL: https://issues.apache.org/jira/browse/HBASE-21377 > Project: HBase > Issue Type: Sub-task > Components: proc-v2 > Reporter: Duo Zhang > Priority: Major > Fix For: 3.0.0, 2.2.0 > > Attachments: HBASE-21377-debuglog.patch > > > TestMergeTableRegionsProcedure is still flakey, and found this in the output > {noformat} > 2018-10-24 03:46:12,842 ERROR [Time-limited test] wal.WALProcedureTree(198): > Missing stack id 6, max stack id is 8, root procedure is Procedure(pid=42, > ppid=-1, > class=org.apache.hadoop.hbase.master.assignment.MergeTableRegionsProcedure) > 2018-10-24 03:46:12,847 ERROR [Time-limited test] > procedure2.ProcedureExecutor$2(451): Corrupt pid=42, > state=WAITING:MERGE_TABLE_REGIONS_CHECK_CLOSED_REGIONS, hasLock=false; > MergeTableRegionsProcedure table=testRollbackAndDoubleExecution, > regions=[72aed4d14ac73faaa1755e248a55b71a, a848f3ca26989865d5cccc9cd0683ae6], > forcibly=false > 2018-10-24 03:46:12,847 ERROR [Time-limited test] > procedure2.ProcedureExecutor$2(451): Corrupt pid=43, ppid=42, > state=RUNNABLE:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; > TransitRegionStateProcedure table=testRollbackAndDoubleExecution, > region=72aed4d14ac73faaa1755e248a55b71a, UNASSIGN > 2018-10-24 03:46:12,848 ERROR [Time-limited test] > procedure2.ProcedureExecutor$2(451): Corrupt pid=44, ppid=42, > state=WAITING:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; > TransitRegionStateProcedure table=testRollbackAndDoubleExecution, > region=a848f3ca26989865d5cccc9cd0683ae6, UNASSIGN > 2018-10-24 03:46:12,848 ERROR [Time-limited test] > procedure2.ProcedureExecutor$2(451): Corrupt pid=45, ppid=43, state=SUCCESS, > hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure > 2018-10-24 03:46:12,849 ERROR [Time-limited test] > procedure2.ProcedureExecutor$2(451): Corrupt pid=46, ppid=44, state=RUNNABLE, > hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure > {noformat} > Need to dig more. -- This message was sent by Atlassian JIRA (v7.6.3#76005)