Michael, The Namenode is also a SPOF.
On keeping a separate cluster for failover, until we add higher availability, it depends on how much uptime you need to provide I guess. I personally never saw a machine hosting a Master failing, so I'm not sure on how clean it can be regards the META, but I think that just closing your region servers, changing the config for the master then restart the cluster with a new master would be nearly enough provided that the Namenode was hosted on another machine. Maybe Jim or Stack can confirm? J-D On Fri, Nov 7, 2008 at 5:33 AM, Michael Dagaev <[EMAIL PROTECTED]>wrote: > Hi, all > > I guess that Hbase master server is a single point of failure. Is > it correct ? Does Hbase (I mean the whole stack -- HDFS + Hbase) have > any other single point of failure ? > > If Hbase has a single point of failure we should arrange a backup > Hbase cluster to switch to it in case of failure. Does it make sense ? > > Thank you for your cooperation, > M. >
