The whole error message also contains Unable to read additional data from server
14/07/24 23:36:06 INFO zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x1476a43ae540009, likely server has closed socket, closing socket connection and attempting reconnect 14/07/24 23:36:07 WARN zk.ZooKeeperExt: exists: Connection loss on attempt 0, waiting 5000 msecs before retrying. org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /_hadoopBsp/job_201407242252_0004/_workerProgresses at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041) at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069) at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360) at org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87) at java.lang.Thread.run(Thread.java:744) On Thu, Jul 24, 2014 at 4:20 PM, Jing Fan <[email protected]> wrote: > I changed the hostname but it doesn't work. > > The error seems to be related to zookeeper :( > > > On Thu, Jul 24, 2014 at 9:49 AM, Young Han <[email protected]> wrote: > >> Hi, >> >> Try making your hostname all lower case (sudo hostname xxxx, and change >> /etc/hostname). I think this may be an issue caused by/related to the >> GIRAPH-904 patch. >> >> Young >> >> >> On Thu, Jul 24, 2014 at 10:25 AM, Jing Fan <[email protected]> wrote: >> >>> Does anyone know the reason of this error and the solution? >>> >>> Thank you very much! >>> >>> >>> On Wed, Jul 23, 2014 at 4:51 PM, Jing Fan <[email protected]> wrote: >>> >>>> Hi Vivek, >>>> >>>> Giraph comes wtih Zookeeper itself so I didn't install it separately. I >>>> run the same program several times. Sometimes it can succeed but most of >>>> the time it will come with the Zookeeper null error. >>>> >>>> Thanks. >>>> >>>> >>>> On Wed, Jul 23, 2014 at 4:35 PM, Sardeshmukh, Vivek < >>>> [email protected]> wrote: >>>> >>>>> Hi Jing, >>>>> >>>>> >>>>> Did you install Zookeeper? Is it running? If yes, then on >>>>> which port? I'm not sure whether it is necessary to install Zookeeper >>>>> separately but it's not that complicated to install so installed it >>>>> and I didn't had any problem. >>>>> >>>>> >>>>> To install zookeeper : >>>>> >>>>> wget --no-check-certificate >>>>> http://apache.mesi.com.ar/zookeeper/zookeeper-3.4.5/zookeeper-3.4.5.tar.gz >>>>> >>>>> >>>>> tar zxvf zookeeper-3.4.5.tar.gz >>>>> >>>>> cd zookeeper-3.4.5 >>>>> >>>>> mv conf/zoo_sample.cfg conf/zoo.cfg >>>>> >>>>> sudo bin/zkServer.sh start >>>>> >>>>> >>>>> And when running a giraph job specify the zookeeper port- >>>>> -Dgiraph.zkList=<hostname where u installed zookeeper>:2181 >>>>> >>>>> >>>>> >>>>> Vivek >>>>> ------------------------------ >>>>> *From:* Jing Fan <[email protected]> >>>>> *Sent:* Wednesday, July 23, 2014 4:14 PM >>>>> *To:* [email protected] >>>>> *Subject:* Zookeeper server null error when running giraph >>>>> >>>>> Hi, >>>>> >>>>> When I run giraph, I always have following error message and the >>>>> task will fail >>>>> >>>>> 14/07/23 23:10:21 INFO zookeeper.ClientCnxn: Opening socket connection >>>>> to server localhost/127.0.0.1:22181. Will not attempt to authenticate >>>>> using SASL (unknown error) >>>>> 14/07/23 23:10:21 WARN zookeeper.ClientCnxn: Session 0x1476505b6cd0009 >>>>> for server null, unexpected error, closing socket connection and >>>>> attempting reconnect >>>>> java.net.ConnectException: Connection refused >>>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) >>>>> at >>>>> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) >>>>> at >>>>> org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) >>>>> at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) >>>>> >>>>> >>>>> Does anyone know the reason and how to solve it? >>>>> >>>>> >>>>> Thanks! >>>>> >>>>> >>>> >>> >> >
