[
https://issues.apache.org/jira/browse/HBASE-24015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Jasani resolved HBASE-24015.
----------------------------------
Hadoop Flags: Reviewed
Resolution: Fixed
> Coverage for Assign and Unassign of Regions on RegionServer on failure
> ----------------------------------------------------------------------
>
> Key: HBASE-24015
> URL: https://issues.apache.org/jira/browse/HBASE-24015
> Project: HBase
> Issue Type: Test
> Components: amv2
> Reporter: Michael Stack
> Assignee: Sandeep Pal
> Priority: Major
> Fix For: 3.0.0-alpha-1, 2.3.3, 1.7.0, 2.4.0
>
>
> Looking at 'HBASE-23984 [Flakey Tests] TestMasterAbortAndRSGotKilled fails in
> teardown', and at UnassignRegionHandler, AssignRegionHandler,
> CloseRegionHandler, and the work that is done inline w/ request vs that which
> to the side in executors, we need more coverage and specification of what
> happens around the edges. This coverage would be more to see if holes in our
> handling currently in a unit test case context before we see it out on
> clusters.
> HBASE-23984 addresses holes where UnassignRegionHandler and
> AssignRegionHandler could skip out w/o clearing Regions from the
> RegionServer#regionsInTransitionInRS Map of Regions In Transition if failed
> open or close because the RegionServer is aborting.
> Other holes seem lurking. On exception, we were leaving entries in the
> RegionServer# submittedRegionProcedure map added by HBASE-2204; not the end
> of the world but they should be cleared on error? HBASE-23984 adds clearning
> from submittedRegionProcedure but then procedures even if failed get added to
> the cache of procedures... so if we try to run the procedure again against
> this server it won't be scheduled.
> interesting stuff.
> This issue is about adding tests that fail assign/unassign/close on the
> RegionServer side making sure RS state is left in a good condition on fail.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)