Mona Chitnis created OOZIE-1921:
-----------------------------------

             Summary: Curator client reports connection loss to ZK under high 
load
                 Key: OOZIE-1921
                 URL: https://issues.apache.org/jira/browse/OOZIE-1921
             Project: Oozie
          Issue Type: Bug
          Components: HA
    Affects Versions: trunk
            Reporter: Mona Chitnis
             Fix For: trunk


Seeing two types of Connection Loss exceptions via Curator when running Oozie 
in high load

h5. [1] (znode transaction type: delete)
{code}
org.apache.curator.CuratorConnectionLossException: KeeperErrorCode = 
ConnectionLoss
        at 
org.apache.curator.ConnectionState.checkTimeouts(ConnectionState.java:198)
        at 
org.apache.curator.ConnectionState.getZooKeeper(ConnectionState.java:88)
        at 
org.apache.curator.CuratorZookeeperClient.getZooKeeper(CuratorZookeeperClient.java:115)
{code}

h5. [2]
{code}
org.apache.zookeeper.KeeperException$ConnectionLossException:
KeeperErrorCode = ConnectionLoss for
/oozie/locks/0037706-140704041907-oozie-oozi-W
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
        at org.apache.zookeeper.ZooKeeper.getChildren(ZooKeeper.java:1472)
{code}

We should probably implement a fallback approach in Oozie while invoking 
Curator library to handle any inherent limitations. But not able to find much 
documentation about Curator benchmarks.




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to