[ https://issues.apache.org/jira/browse/HADOOP-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jim Kellerman updated HADOOP-1960: ---------------------------------- Attachment: patch.txt TestMasterAbort - New test MiniHBaseCluster - Add getter that returns the HMaster object TestRegionServerAbort - Add check for scanner == null before trying to close it TestSplit - Enclose test body in try catch block so that exceptions can be dumped to the console at the point in the test where they occur. HRegionServer - If unable to communicate with the master for more than the lease timeout interval abort server. HMaster - Add abort method - If aborting, ignores region server reports for 1 1/2 times lease period > [hbase] If a region server cannot talk to the master after several attempts, > it should shut itself down > ------------------------------------------------------------------------------------------------------- > > Key: HADOOP-1960 > URL: https://issues.apache.org/jira/browse/HADOOP-1960 > Project: Hadoop > Issue Type: Improvement > Components: contrib/hbase > Affects Versions: 0.15.0 > Reporter: Jim Kellerman > Assignee: Jim Kellerman > Fix For: 0.15.0 > > Attachments: patch.txt > > > If a region server cannot contact the master after a configurable number of > tries, it should shut itself down. > If the region server cannot contact the master, > - if the master is alive but the network is partitioned, the master will > probably time out the region server's lease and try to recover the server's > log and reassign the regions the server is serving. > - if the master has died, and subsequently restarts, it will be reassigning > regions anyway, so the region server should stop serving the regions. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.