hi, We have a 4-node cluster Hadoop 0.19.0 and HBase 0.19.0. We run NameNode and RegionServer on the same server and created a bunch of tables on HBase.
Now we want to use another (more powerful) machine to replace the old master. Here is what we did: 1) Shutdown HBase and Hadoop 2) Copy all the Hadoop related files from the old master to the new master. 3) Re-configure the Hadoop and HBase so all (including the master and clients) are now pointing to the new master. 4) Start the Hadoop cluster. (This seems fine). 5) Start the HBase cluster. (This seems fine too). Then when we try to do a "count" in HBase shell, (e.g. count 'table_name'), we hit the following problem: 09/02/27 21:53:04 INFO ipc.HBaseClass: Retrying connect to server: / 10.249.190.85:60020. Already tried 0 time(s). 09/02/27 21:53:05 INFO ipc.HBaseClass: Retrying connect to server: / 10.249.190.85:60020. Already tried 1 time(s). 09/02/27 21:53:06 INFO ipc.HBaseClass: Retrying connect to server: / 10.249.190.85:60020. Already tried 2 time(s). 09/02/27 21:53:06 INFO ipc.HbaseRPC: Server at /10.249.190.85:60020 not available yet, Zzzzz... 09/02/27 21:53:06 INFO ipc.HbaseRPC: Server at /10.249.190.85:60020 could not be reached after 1 tries, giving up. 09/02/27 21:53:09 INFO ipc.HBaseClass: Retrying connect to server: / 10.249.190.85:60020. Already tried 0 time(s). 09/02/27 21:53:10 INFO ipc.HBaseClass: Retrying connect to server: / 10.249.190.85:60020. Already tried 1 time(s). 09/02/27 21:53:11 INFO ipc.HBaseClass: Retrying connect to server: / 10.249.190.85:60020. Already tried 2 time(s). The IP address showing here is actually the old master's IP address instead of the new one's. We tried "list" and "scan" commands in the HBase shell, both of them are working good. Just the "count" reported the above error. What's the problem here? Thanks, Yan
