Hi,

We are experiencing an issue in our HBase Cluster wherein some of the gets are 
timing outs at:

java.io.IOException: Giving up trying to get region server: thread is 
interrupted.
                at 
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getRegionServerWithRetries(HConnectionManager.java:1016)
                at org.apache.hadoop.hbase.client.HTable.get(HTable.java:546)


When we look at the logs of master, zookeeper and region servers, there is 
nothing that indicates anything abnormal.

I tried looking up below functions, but at this point could not make much out 
of it.
https://github.com/apache/hbase/blob/trunk/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java
  - getRegionServerWithRetries  starts at Line 1233
https://github.com/apache/hbase/blob/trunk/src/main/java/org/apache/hadoop/hbase/client/HTable.java
  Htable.get starts at Line 611.


If you can please suggest what are the scenarios in which all retries can get 
exhausted resulting in thread interruption.

We have seen this issue in two of our HBase Clusters, where load is quite less. 
 We have 20 reads per minute,  we run 1 zookeeper, and 4 regionservers in 
fully-distributed mode (Hadoop).  We are using CDH3.

Thanks,
Srikanth

________________________________

http://www.mindtree.com/email/disclaimer.html

Reply via email to