Can you check the zk logs at /var/log/zookeeper/zookeeper.out and see if you find something?
Also, see if you can use the zkCli.sh client to query in a loop for few minutes (with few secs of interval between queries) and see if you get similar intermittent connection issues? -Gour On 2/25/15, 6:53 AM, "Jon Maron" <[email protected]> wrote: >I¹ve noticed that I¹m having intermittent issues accessing the zookeeper >quorum during ³destroy² attempts: > >2015-02-25 09:48:02,345 [main] WARN client.SliderClient >(SliderClient.java:getZkClient(523)) - Unable to connect to zookeeper >quorum >c6402.ambari.apache.org:2181,c6404.ambari.apache.org:2181,c6403.ambari.apa >che.org:2181,c6405.ambari.apache.org:2181 >java.net.ConnectException: Unable to connect to ZK quorum > at >org.apache.slider.core.zk.BlockingZKWatcher.waitForZKConnection(BlockingZK >Watcher.java:63) > at >org.apache.slider.client.SliderClient.getZkClient(SliderClient.java:518) > at >org.apache.slider.client.SliderClient.deleteZookeeperNode(SliderClient.jav >a:458) > at >org.apache.slider.client.SliderClient.actionDestroy(SliderClient.java:550) > at org.apache.slider.client.SliderClient.exec(SliderClient.java:383) > at >org.apache.slider.client.SliderClient.runService(SliderClient.java:348) > at >org.apache.slider.core.main.ServiceLauncher.launchService(ServiceLauncher. >java:188) > at >org.apache.slider.core.main.ServiceLauncher.launchServiceRobustly(ServiceL >auncher.java:475) > at >org.apache.slider.core.main.ServiceLauncher.launchServiceAndExit(ServiceLa >uncher.java:403) > at >org.apache.slider.core.main.ServiceLauncher.serviceMain(ServiceLauncher.ja >va:630) > at org.apache.slider.Slider.main(Slider.java:49) >2015-02-25 09:48:02,656 [main] DEBUG client.SliderClient >(SliderClient.java:deleteZookeeperNode(474)) - Unable to recursively >delete zk node /services/slider/users/jmaron/hbase-test >2015-02-25 09:48:02,656 [main] DEBUG client.SliderClient >(SliderClient.java:deleteZookeeperNode(475)) - Reason: >org.apache.zookeeper.KeeperException$ConnectionLossException: >KeeperErrorCode = ConnectionLoss for >/services/slider/users/jmaron/hbase-test > at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1045) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1073) > at org.apache.slider.core.zk.ZKIntegration.stat(ZKIntegration.java:164) > at > org.apache.slider.core.zk.ZKIntegration.exists(ZKIntegration.java:160) > at >org.apache.slider.client.SliderClient.deleteZookeeperNode(SliderClient.jav >a:460) > at >org.apache.slider.client.SliderClient.actionDestroy(SliderClient.java:550) > at org.apache.slider.client.SliderClient.exec(SliderClient.java:383) > at >org.apache.slider.client.SliderClient.runService(SliderClient.java:348) > at >org.apache.slider.core.main.ServiceLauncher.launchService(ServiceLauncher. >java:188) > at >org.apache.slider.core.main.ServiceLauncher.launchServiceRobustly(ServiceL >auncher.java:475) > at >org.apache.slider.core.main.ServiceLauncher.launchServiceAndExit(ServiceLa >uncher.java:403) > at >org.apache.slider.core.main.ServiceLauncher.serviceMain(ServiceLauncher.ja >va:630) > at org.apache.slider.Slider.main(Slider.java:49) > >Any ideas on why that may occur? My cluster is running on a set of VMs >on my development box. These failed ZK interactions will subsequently >yield issues in trying to recreate the given application (in this case >HBase) > >‹ Jon
