Hello Users/Authors

 

Well we've observed in our cluster , that HMaster went down due to watched
event triggered from zookeeper, of type session expired.

 

Why not reconnect back to the zookeeper(at least try once and then abort, if
unsuccessful) and resetting trackers/watchers instead of aborting/killing
HMaster/HRegionServers just like it is done in one of the implementation of
abort able named HConnectionImplementation present in HConnectionManager?

 

Kindly brief me upon this design strategy.

 

Thanks

-Mohit

****************************************************************************
***********
This e-mail and attachments contain confidential information from HUAWEI,
which is intended only for the person or entity whose address is listed
above. Any use of the information contained herein in any way (including,
but not limited to, total or partial disclosure, reproduction, or
dissemination) by persons other than the intended recipient's) is
prohibited. If you receive this e-mail in error, please notify the sender by
phone or email immediately and delete it!

 

Reply via email to