[
https://issues.apache.org/jira/browse/HBASE-21377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Duo Zhang reassigned HBASE-21377:
---------------------------------
Assignee: Duo Zhang
> Add debug log for procedure stack id related operations
> -------------------------------------------------------
>
> Key: HBASE-21377
> URL: https://issues.apache.org/jira/browse/HBASE-21377
> Project: HBase
> Issue Type: Sub-task
> Components: proc-v2
> Reporter: Duo Zhang
> Assignee: Duo Zhang
> Priority: Major
> Fix For: 3.0.0, 2.2.0
>
> Attachments: HBASE-21377-debuglog.patch
>
>
> TestMergeTableRegionsProcedure is still flakey, and found this in the output
> {noformat}
> 2018-10-24 03:46:12,842 ERROR [Time-limited test] wal.WALProcedureTree(198):
> Missing stack id 6, max stack id is 8, root procedure is Procedure(pid=42,
> ppid=-1,
> class=org.apache.hadoop.hbase.master.assignment.MergeTableRegionsProcedure)
> 2018-10-24 03:46:12,847 ERROR [Time-limited test]
> procedure2.ProcedureExecutor$2(451): Corrupt pid=42,
> state=WAITING:MERGE_TABLE_REGIONS_CHECK_CLOSED_REGIONS, hasLock=false;
> MergeTableRegionsProcedure table=testRollbackAndDoubleExecution,
> regions=[72aed4d14ac73faaa1755e248a55b71a, a848f3ca26989865d5cccc9cd0683ae6],
> forcibly=false
> 2018-10-24 03:46:12,847 ERROR [Time-limited test]
> procedure2.ProcedureExecutor$2(451): Corrupt pid=43, ppid=42,
> state=RUNNABLE:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false;
> TransitRegionStateProcedure table=testRollbackAndDoubleExecution,
> region=72aed4d14ac73faaa1755e248a55b71a, UNASSIGN
> 2018-10-24 03:46:12,848 ERROR [Time-limited test]
> procedure2.ProcedureExecutor$2(451): Corrupt pid=44, ppid=42,
> state=WAITING:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false;
> TransitRegionStateProcedure table=testRollbackAndDoubleExecution,
> region=a848f3ca26989865d5cccc9cd0683ae6, UNASSIGN
> 2018-10-24 03:46:12,848 ERROR [Time-limited test]
> procedure2.ProcedureExecutor$2(451): Corrupt pid=45, ppid=43, state=SUCCESS,
> hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure
> 2018-10-24 03:46:12,849 ERROR [Time-limited test]
> procedure2.ProcedureExecutor$2(451): Corrupt pid=46, ppid=44, state=RUNNABLE,
> hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure
> {noformat}
> Need to dig more.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)