It does seem like a regression. If u kill the zk data dir and restart the cluster does it work? (root location is up in zk)
Stack On Jan 13, 2011, at 11:37, James Kennedy <james.kenn...@troove.net> wrote: > I'm currently validating the new 0.90.0 RC3 with the hbase-trx layer and our > own application. > > All seems well so far except for the fact that I now find that HBase doesn't > adapt if I try to run the same data on different machines. > > e.g. > 1) I work from home and generated our seeded test data. > 2) Run the test suite and all tests pass > 3) I go to the office and re-run the tests. > > Result: HMaster fails because the .ROOT data has the wrong ip address for > locating the .META. At least that is my understanding from the stacktrace > below. Note that the 192.168.1.102 IP address in that trace is the IP from > my home network and is incorrect. > > This wasn't an issue with previous versions of HBase as far as I've noticed. > And this seems to be a big data portability fail. > Surely the HMaster should be able to absorb stale metadata and wait for new > region-servers to check in. > Instead it just keels over and dies. > But before logging a case I wanted to know if there was something I'm > obviously missing or doing wrong. > > The seeded test data is on HDFS. > > Thoughts? > > > [13/01/11 10:58:42] 5939 [ main] INFO > ion.service.HBaseRegionService - troove> Starting region server thread. > [13/01/11 11:00:15] 98699 [ HMaster] FATAL > he.hadoop.hbase.master.HMaster - Unhandled exception. Starting shutdown. > java.net.SocketTimeoutException: 20000 millis timeout while waiting for > channel to be ready for connect. ch : > java.nio.channels.SocketChannel[connection-pending > remote=192.168.1.102/192.168.1.102:60020] > at > org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:213) > at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:404) > at > org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupIOstreams(HBaseClient.java:311) > at > org.apache.hadoop.hbase.ipc.HBaseClient.getConnection(HBaseClient.java:865) > at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:732) > at org.apache.hadoop.hbase.ipc.HBaseRPC$Invoker.invoke(HBaseRPC.java:258) > at $Proxy15.getProtocolVersion(Unknown Source) > at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:419) > at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:393) > at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:444) > at org.apache.hadoop.hbase.ipc.HBaseRPC.waitForProxy(HBaseRPC.java:349) > at > org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getHRegionConnection(HConnectionManager.java:954) > at > org.apache.hadoop.hbase.catalog.CatalogTracker.getCachedConnection(CatalogTracker.java:384) > at > org.apache.hadoop.hbase.catalog.CatalogTracker.getMetaServerConnection(CatalogTracker.java:283) > at > org.apache.hadoop.hbase.catalog.CatalogTracker.verifyMetaRegionLocation(CatalogTracker.java:478) > at > org.apache.hadoop.hbase.master.HMaster.assignRootAndMeta(HMaster.java:435) > at > org.apache.hadoop.hbase.master.HMaster.finishInitialization(HMaster.java:382) > at org.apache.hadoop.hbase.master.HMaster.run(HMaster.java:277) > at java.lang.Thread.run(Thread.java:680) > > > James Kennedy > Troove Inc. > >