Improve lease handling
----------------------
Key: HBASE-1156
URL: https://issues.apache.org/jira/browse/HBASE-1156
Project: Hadoop HBase
Issue Type: Improvement
Components: master, regionserver
Affects Versions: 0.19.0
Reporter: Jim Kellerman
Assignee: Jim Kellerman
Fix For: 0.20.0
Currently, if a region server crashes and then restarts, it cannot be given
work until its lease times out. This is because a lease is only identified by
ipaddress:portnumber. If leases were also identified with the start code, the
server could be given work immediately, because its log file includes the start
code and will not interfere with the recovery of the log from its previous
incarnation.
Additionally, we wait in a master server thread for the server to leave the
dead servers list because dead servers are not identified by their start code
either. Waiting in a master server thread ties up that thread (possibly for
quite some time), and rather than waiting, we should throw an exception as the
region server already knows how to deal with an exception thrown from a
regionServerStartup call.
Finally, there is a bit of code cleanup that needs to be done in the region
server when it receives a MSG_CALL_SERVER_STARTUP response from the master. It
should not set up the HLog until reportForDuty completes
successfully (which is what it does on the initial reportForDuty call.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.