[ 
https://issues.apache.org/jira/browse/HADOOP-1605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12525800
 ] 

dhruba borthakur commented on HADOOP-1605:
------------------------------------------

On second thoughts, if the namenode catches exceptions and restarts, it still 
does not prevent the case when then JVM encounters a fault and exits. 

A better approach would be to have a WatchNode process that is told to monitor 
the Namenode. If the Namenode process exits, then the WatchNode process will 
restart the Namenode. A WatchNode process could be configured to watch multiple 
servers (Namenode, DataNode, Secondarynamenode, JobTracker, etc).



> Automatic namenode restart when it encounters an error situation
> ----------------------------------------------------------------
>
>                 Key: HADOOP-1605
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1605
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: dfs
>            Reporter: dhruba borthakur
>            Assignee: dhruba borthakur
>             Fix For: 0.15.0
>
>         Attachments: namenodeRestart4.patch
>
>
> The namenode dies when it encounters an unexpected Runtime Exception. 
> Instead, it can catch exceptions, clears up all its internal data structures 
> and restarts. This was attempted in HADOOP-1486 earlier.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to