[ 
https://issues.apache.org/jira/browse/HBASE-3755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13018505#comment-13018505
 ] 

Jean-Daniel Cryans commented on HBASE-3755:
-------------------------------------------

>From the mailing list:

{quote}
11/04/08 12:20:06 WARN zookeeper.ClientCnxn: Session 0x0 for server null, 
unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
       at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
       at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:567)
       at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1078)
11/04/08 12:20:06 WARN zookeeper.ZooKeeperWrapper: Problem getting stats for 
/hbase/rs
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = 
ConnectionLoss for /hbase/rs
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:90)
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
       at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:809)
       at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:837)
       at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.getRSDirectoryCount(ZooKeeperWrapper.java:754)
       at org.apache.hadoop.hbase.client.HTable.getCurrentNrHRS(HTable.java:173)
       at org.apache.hadoop.hbase.client.HTable.<init>(HTable.java:147)
       at 
org.apache.hadoop.hbase.client.MetaScanner.metaScan(MetaScanner.java:102)
       at 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers.prefetchRegionCache(HConnectionManager.java:732)
       at 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers.locateRegionInMeta(HConnectionManager.java:783)
       at 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers.locateRegion(HConnectionManager.java:677)
       at 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers.relocateRegion(HConnectionManager.java:650)
       at 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers.getRegionLocation(HConnectionManager.java:470)
       at 
org.apache.hadoop.hbase.client.ServerCallable.instantiateServer(ServerCallable.java:57)
       at 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers.getRegionServerWithRetries(HConnectionManager.java:1145)
       at org.apache.hadoop.hbase.client.HTable.get(HTable.java:503)
{quote}

This is really ugly, I think this is what you were pointing out Gary?

> Catch zk's ConnectionLossException and augment error message with more help
> ---------------------------------------------------------------------------
>
>                 Key: HBASE-3755
>                 URL: https://issues.apache.org/jira/browse/HBASE-3755
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.90.1
>            Reporter: Jean-Daniel Cryans
>            Assignee: Jean-Daniel Cryans
>             Fix For: 0.90.3
>
>         Attachments: HBASE-3755.patch
>
>
> 0.90 has a different behavior regarding ZK connections, it tends to create 
> too many of them and it's not obvious to users what they should do to fix. I 
> think I've helped at least 5 different users this week with this error.
> By catching ConnectionLossException and augmenting its message, we could say 
> something like "it's possible that the ZooKeeper server has too many 
> connections from this IP, see doc at blah" since the ZK server isn't nice 
> enough to let us know what's going on.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to