[ 
https://issues.apache.org/jira/browse/HBASE-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846772#action_12846772
 ] 

Todd Lipcon commented on HBASE-2342:
------------------------------------

Here's a crazy idea: JVM-TI (the JVM tool interface) allows you to hook events 
like GC start and GC stop. This could potentially be useful while watching the 
region server to understand whether the RS is "just sleeping" in GC or actually 
hosed in some way. It's also a pain in the butt because it requires native 
code, but worth thinking about.

> Consider adding a watchdog node next to region server
> -----------------------------------------------------
>
>                 Key: HBASE-2342
>                 URL: https://issues.apache.org/jira/browse/HBASE-2342
>             Project: Hadoop HBase
>          Issue Type: New Feature
>          Components: regionserver
>            Reporter: Todd Lipcon
>
> This idea has been bandied about a fair amount. The concept is to add a 
> second java process that runs next to each region server to act as a 
> watchdog. Several possible purposes:
> - monitor the RS for liveness - if it exhibits Juliet syndrome ("appears 
> dead") then we kill it agressively to prevent it from coming back to life
> - restart RS automatically in failure cases
> - potentially move the entire ZK session to the watchdog to decouple node 
> liveness from the particular JVM liveness
> Let's discuss in this JIRA.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to