hi,

We have a 4-node cluster Hadoop 0.19.0 and HBase 0.19.0. We run NameNode and
RegionServer on the same server and created a bunch of tables on HBase.

Now we want to use another (more powerful) machine to replace the old
master. Here is what we did:

1) Shutdown HBase and Hadoop
2) Copy all the Hadoop related files from the old master to the new master.
3) Re-configure the Hadoop and HBase so all (including the master and
clients) are now pointing to the new master.
4) Start the Hadoop cluster. (This seems fine).
5) Start the HBase cluster. (This seems fine too).

Then when we try to do a "count" in HBase shell, (e.g. count 'table_name'),
we hit the following problem:

09/02/27 21:53:04 INFO ipc.HBaseClass: Retrying connect to server: /
10.249.190.85:60020. Already tried 0 time(s).
09/02/27 21:53:05 INFO ipc.HBaseClass: Retrying connect to server: /
10.249.190.85:60020. Already tried 1 time(s).
09/02/27 21:53:06 INFO ipc.HBaseClass: Retrying connect to server: /
10.249.190.85:60020. Already tried 2 time(s).
09/02/27 21:53:06 INFO ipc.HbaseRPC: Server at /10.249.190.85:60020 not
available yet, Zzzzz...
09/02/27 21:53:06 INFO ipc.HbaseRPC: Server at /10.249.190.85:60020 could
not be reached after 1 tries, giving up.
09/02/27 21:53:09 INFO ipc.HBaseClass: Retrying connect to server: /
10.249.190.85:60020. Already tried 0 time(s).
09/02/27 21:53:10 INFO ipc.HBaseClass: Retrying connect to server: /
10.249.190.85:60020. Already tried 1 time(s).
09/02/27 21:53:11 INFO ipc.HBaseClass: Retrying connect to server: /
10.249.190.85:60020. Already tried 2 time(s).

The IP address showing here is actually the old master's IP address instead
of the new one's.

We tried "list" and "scan" commands in the HBase shell, both of them are
working good. Just the "count" reported the above error.

What's the problem here?

Thanks,
Yan

Reply via email to