Please consider enabling debug logs: http://wiki.apache.org/hadoop/Hbase/FAQ#5
Also if you are having a problem with a process going down, please do 'kill -QUIT <pid>" to get a stack dump into the .out file. This helps us diagnose your problem. Until http://issues.apache.org/jira/browse/HADOOP-4379 is fixed, kill -9 of a regionserver will lose data. We are putting in extra logfile closes in 0.20, but you stand to risk up to ~60mbish of edits if you are under a heavy write load. People _are_ working on HADOOP-4379 as we speak. On Fri, May 15, 2009 at 12:39 AM, Sasha Dolgy <[email protected]> wrote: > you need to find the root cause. i've had this before and it worked out to > be a permissions issue on some of the files. log files are your friend... > > On Fri, May 15, 2009 at 8:14 AM, Ninad Raut <[email protected] > >wrote: > > > Hi Andy, > > But what if the daemon.sh keep running for ever? It just doent complete > > even > > after an hour or so. Any other workaround? > > > > On Thu, May 14, 2009 at 10:44 PM, Andrew Purtell <[email protected] > > >wrote: > > > > > NEVER KILL -9 A REGION SERVER!!! > > > > > > - Andy > > > > > > > > > > > > > > > ________________________________ > > > From: Ninad Raut <[email protected]> > > > To: [email protected] > > > Cc: Ranjit Nair <[email protected]> > > > Sent: Thursday, May 14, 2009 3:03:06 AM > > > Subject: Re: Administration tool for HBase > > > > > > Ryan, > > > Using bin/hbase-daemon.sh start regionserver and bin/hbase-daemon.sh > stop > > > regionserver can we add/remove slave nodes when the cluster is live? > > > How to handle a region not serving exception? Because in this scenario > > the > > > daemon.sh script seems to go on forever? > > > We usually kill the HRegion process using kill -9, which causes loss of > > > data > > > if it is not flushed. > > > Regards, > > > Ninad > > > > > > On Thu, May 14, 2009 at 12:26 PM, Ryan Rawson <[email protected]> > > wrote: > > > > > > > There are tools: > > > > > > > > bin/start-hbase.sh > > > > bin/stop-hbase.sh > > > > > > > > starts/stops cluster. > > > > > > > > per node: > > > > bin/hbase-daemon.sh start regionserver > > > > bin/hbase-daemon.sh stop regionserver > > > > > > > > you might need HBASE-1362 however. > > > > > > > > On Wed, May 13, 2009 at 11:46 PM, Ninad Raut < > > [email protected] > > > > >wrote: > > > > > > > > > Hi, > > > > > I have been asekd to automate the following processes in HBase: > > > > > > > > > > - Script to add/remove regionservers and datanodes from a cluster > > > when > > > > > cluster is live > > > > > > > > > > > > > > > - Script to handle master node failure > > > > > - Script for graceful shutdown/startup of hbase > > > > > > > > > > Is there any adminsitration tool present for hbase. How can we use > > > Hadoop > > > > > On > > > > > Demand and/or zookeeper to make the cluster failure resistant. > > > > > Regards, > > > > > Ninad > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > Sasha Dolgy > [email protected] >
