Hi, Thanks for update. After spending quite a bit of time on Hadoop/HBase I couldn't find any thing awkward in logs. At last what I got to know is the reason for outage is IO Error thrown by the one of disk in which we are storing NameNode files.
One more suggestion we need is regarding NameNode HA. Since we are using hbase-0.94.1 which version of Hadoop we should apt for NameNode HA. We can't move away from HBase 0.94.1 in near future, and we want to adapt NameNode HA. Can someone suggest us some suitable solutions for us? Thanks,Sandeep. From: [email protected] Date: Wed, 27 Nov 2013 10:56:44 +0530 Subject: Re: Suddenly NameNode stopped responding To: [email protected] CC: [email protected] It is difficult to guess the reason behind this outage without the logs. Can we have a look at them? (pastebin). Did you configure HA for namenode? Did it failover to standby? On Wed, Nov 27, 2013 at 10:29 AM, Sandeep L <[email protected]> wrote: Hi, Couple of hours back all of sudden NameNode of our production cluster got stopped responding, due to this our HBase also stopped responding(as expected). Here mysterious thing is we unable to get any reason for NameNode interruption. I went through all log files of NameNode and I couldn't find any exception in logs. Can someone guess what could be the probable reason for this issue? Any one previously faced similar issue? We are using hbase-0.92.1 with hadoop-1.0.2 If you need any other information please let me know. Thanks,Sandeep. -- Bharath Vissapragada
