Ramana Inukonda Nagaraj created DRILL-2120:
----------------------------------------------
Summary: Bringing up multiple drillbits at same time results in
synchronization failure
Key: DRILL-2120
URL: https://issues.apache.org/jira/browse/DRILL-2120
Project: Apache Drill
Issue Type: Bug
Components: Tools, Build & Test
Affects Versions: 0.8.0
Reporter: Ramana Inukonda Nagaraj
Assignee: Jacques Nadeau
Repro:
With a fresh ZK install bring up 4 drillbits at the same time using something
like clush
clush -g ats /opt/drill/bin/drillbit.sh start
Looks like all 4 nodes try to query the ZK to see if the node exists and all of
them try to create it at the same time. Some succeed, Others don't. The ones
which fail have incorrect information about the state of the ZK and that would
explain the below stacktrace.
{code}
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more
info.
Exception in thread "main"
org.apache.drill.exec.exception.DrillbitStartupException: Failure during
initial startup of Drillbit.
at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:76)
at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:60)
at org.apache.drill.exec.server.Drillbit.main(Drillbit.java:83)
Caused by: java.lang.RuntimeException: Failure while accessing Zookeeper
at
org.apache.drill.exec.store.sys.zk.ZkAbstractStore.putIfAbsent(ZkAbstractStore.java:135)
at
org.apache.drill.exec.store.StoragePluginRegistry.createPlugins(StoragePluginRegistry.java:150)
at
org.apache.drill.exec.store.StoragePluginRegistry.init(StoragePluginRegistry.java:130)
at org.apache.drill.exec.server.Drillbit.run(Drillbit.java:155)
at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:73)
... 2 more
Caused by: java.lang.RuntimeException: Failure while accessing Zookeeper
at
org.apache.drill.exec.store.sys.zk.ZkPStore.createNodeInZK(ZkPStore.java:53)
at
org.apache.drill.exec.store.sys.zk.ZkAbstractStore.putIfAbsent(ZkAbstractStore.java:129)
... 6 more
Caused by: org.apache.zookeeper.KeeperException$NodeExistsException:
KeeperErrorCode = NodeExists for /drill-ats-build/sys.storage_plugins/cp
at org.apache.zookeeper.KeeperException.create(KeeperException.java:119)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783)
at
org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:676)
at
org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:660)
at org.apache.curator.RetryLoop.callWithRetry(RetryLoop.java:107)
at
org.apache.curator.framework.imps.CreateBuilderImpl.pathInForeground(CreateBuilderImpl.java:656)
at
org.apache.curator.framework.imps.CreateBuilderImpl.protectedPathInForeground(CreateBuilderImpl.java:441)
at
org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:431)
at
org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:44)
at
org.apache.drill.exec.store.sys.zk.ZkPStore.createNodeInZK(ZkPStore.java:51)
... 7 more
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)