Re: issue with nutch-gora+hbase+zookeeper

kaveh minooie Fri, 22 Feb 2013 20:58:18 -0800

In case anyone was wondering, the issue was resolved by copying thezoo.cfg in the hadoop conf directory (on classpath really) on the entirecluster. thanks


On 02/22/2013 12:31 PM, kaveh minooie wrote:

Hi everyone
I am having this problem for couple of days now and would appreciateany idea or suggestion that any one might have. I am using nutch 2.xwith hbase. due to nutch requirement I need to use older version ofhbase (I am using 0.90.6 over hadoop 1.1.1 with 10 nodes withzookeepr 3.5.0 [trunk] )
hbase seems to be running fine, thou I appreciate if someone can showme how I can actually test it systematically. but I seem to be able tocreate and read data from hbase, but when I run any nutch commandsomething very similar to this happens as soon as the job startsrunning. this is for example, the output of nutch inject
13/02/22 12:07:30 INFO mapred.JobClient:  map 0% reduce 0%
13/02/22 12:07:52 INFO mapred.JobClient: Task Id :attempt_201302191325_0013_m_000000_0, Status : FAILEDorg.apache.gora.util.GoraException:org.apache.hadoop.hbase.ZooKeeperConnectionException: HBase is able toconnect to ZooKeeper but the connection closes immediately. This couldbe a sign that the server has too many connections (30 is thedefault). Consider inspecting your ZK server logs for that error andthen make sure you are reusing HBaseConfiguration as often as you can.See HTable's javadoc for more information.atorg.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)atorg.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:118)atorg.apache.gora.mapreduce.GoraOutputFormat.getRecordWriter(GoraOutputFormat.java:88)atorg.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.<init>(MapTask.java:628)
    at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:753)
    at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
    at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Unknown Source)
atorg.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1136)
    at org.apache.hadoop.mapred.Child.main(Child.java:249)
Caused by: org.apache.hadoop.hbase.ZooKeeperConnectionException: HBaseis able to connect to ZooKeeper but the connection closes immediately.This could be a sign that the server has too many connections (30 isthe default). Consider inspecting your ZK server logs for that errorand then make sure you are reusing HBaseConfiguration as often as youcan. See HTable's javadoc for more information.atorg.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:156)atorg.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getZooKeeperWatcher(HConnectionManager.java:1265)atorg.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.setupZookeeperTrackers(HConnectionManager.java:526)atorg.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.<init>(HConnectionManager.java:516)atorg.apache.hadoop.hbase.client.HConnectionManager.getConnection(HConnectionManager.java:173)atorg.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:93)atorg.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:108)atorg.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)atorg.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
    ... 10 more
Caused by:org.apache.zookeeper.KeeperException$ConnectionLossException:KeeperErrorCode = ConnectionLoss for /hbaseatorg.apache.zookeeper.KeeperException.create(KeeperException.java:99)atorg.apache.zookeeper.KeeperException.create(KeeperException.java:51)
    at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1237)
    at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1265)
atorg.apache.hadoop.hbase.zookeeper.ZKUtil.createAndFailSilent(ZKUtil.java:931)atorg.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:134)
    ... 18 more
Now I know that I am not running out of connection. for one thing Ihave increased the number of connection to 200 in zoo.cfg, and alsohere is what is in the zookeeper log file around that time:
2013-02-22 12:07:27,704 [myid:] - INFO[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:2181:NIOServerCnxnFactory$AcceptThread@289]- Accepted socket connection from /127.0.0.1:550732013-02-22 12:07:27,707 [myid:] - INFO[NIOWorkerThread-3:ZooKeeperServer@810] - Client attempting toestablish new session at /127.0.0.1:550732013-02-22 12:07:27,720 [myid:] - INFO[SyncThread:0:ZooKeeperServer@566] - Established session0x13d037b8e6b0016 with negotiated timeout 40000 for client/127.0.0.1:550732013-02-22 12:07:27,945 [myid:] - INFO[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:2181:NIOServerCnxnFactory$AcceptThread@289]- Accepted socket connection from /127.0.0.1:550752013-02-22 12:07:27,946 [myid:] - INFO[NIOWorkerThread-2:ZooKeeperServer@810] - Client attempting toestablish new session at /127.0.0.1:550752013-02-22 12:07:27,953 [myid:] - INFO[SyncThread:0:ZooKeeperServer@566] - Established session0x13d037b8e6b0017 with negotiated timeout 40000 for client/127.0.0.1:550752013-02-22 12:07:28,010 [myid:] - INFO [ProcessThread(sid:0cport:-1)::PrepRequestProcessor@533] - Processed session terminationfor sessionid: 0x13d037b8e6b00172013-02-22 12:07:28,011 [myid:] - INFO[NIOWorkerThread-6:NIOServerCnxn@1000] - Closed socket connection forclient /127.0.0.1:55075 which had sessionid 0x13d037b8e6b00172013-02-22 12:08:14,005 [myid:] - WARN[NIOWorkerThread-7:NIOServerCnxn@362] - Unable to read additional datafrom client sessionid 0x13d037b8e6b0016, likely client has closed socket2013-02-22 12:08:14,005 [myid:] - INFO[NIOWorkerThread-7:NIOServerCnxn@1000] - Closed socket connection forclient /127.0.0.1:55073 which had sessionid 0x13d037b8e6b00162013-02-22 12:08:48,000 [myid:] - INFO[SessionTracker:ZooKeeperServer@304] - Expiring session0x13d037b8e6b0016, timeout of 40000ms exceeded2013-02-22 12:08:48,001 [myid:] - INFO [ProcessThread(sid:0cport:-1)::PrepRequestProcessor@533] - Processed session terminationfor sessionid: 0x13d037b8e6b0016
I also don't think that it is a heartbeat or GC related issue sincethere is really no load at all on these servers right now. I know thisis a hybrid problem involving three separate product (nutch, hbase,zookeeper) but I am asking this in all the mailing list. Also I amgonna say it agian in order to avoid confusion with the similarproblems in older versions that are supposedly solved. I am usinghbase 0.90.6 and zookeeper 3.5.0 (commit 46b565e6) with nutch 2.x(commit f02dcf625 ) both are either the latest or very recent updates.
if anyone has any idea what is happening here I very much like to hearthat.
thanks,

Re: issue with nutch-gora+hbase+zookeeper

Reply via email to