brucemen711 opened a new issue #4516:
URL: https://github.com/apache/incubator-dolphinscheduler/issues/4516


   I set spark task (spark streaming) to 100 retries if job failed. But i got 
task hanging when zookeeper OOM. The log is below:
   `java.lang.OutOfMemoryError: Java heap space
        at java.nio.HeapByteBuffer.<init>(HeapByteBuffer.java:57)
        at java.nio.ByteBuffer.allocate(ByteBuffer.java:335)
        at 
org.apache.jute.BinaryOutputArchive.<init>(BinaryOutputArchive.java:33)
        at 
org.apache.jute.BinaryOutputArchive.getArchive(BinaryOutputArchive.java:38)
        at org.apache.zookeeper.ClientCnxn$Packet.createBB(ClientCnxn.java:298)
        at 
org.apache.zookeeper.ClientCnxnSocketNIO.doIO(ClientCnxnSocketNIO.java:115)
        at 
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:366)
        at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1141)
   [WARN] 2021-01-21 03:04:12.958 org.apache.zookeeper.ClientCnxn:[1108] - 
Client session timed out, have not heard from server in 27345ms for sessionid 
0x1b7b1c3b7000d
   [INFO] 2021-01-21 03:04:12.959 
org.apache.dolphinscheduler.server.worker.runner.TaskExecuteThread:[164] - 
develop mode is: false
   [INFO] 2021-01-21 03:04:12.960 org.apache.zookeeper.ClientCnxn:[1156] - 
Client session timed out, have not heard from server in 27345ms for sessionid 
0x1b7b1c3b7000d, closing socket connection and attempting reconnect
   [INFO] 2021-01-21 03:04:12.961 
org.apache.dolphinscheduler.server.worker.runner.TaskExecuteThread:[182] - exec 
local path: /tmp/dolphinscheduler/exec/process/3/7/283/870 cleared.
   [INFO] 2021-01-21 03:04:13.060 
org.apache.curator.framework.state.ConnectionStateManager:[251] - State change: 
SUSPENDED
   [INFO] 2021-01-21 03:04:13.061 
org.apache.curator.framework.state.ConnectionStateManager:[251] - State change: 
SUSPENDED`
   
   **Which version of DolphinScheduler:**
    -[1.3.4]
   
   **Additional context**
   Add any other context about the problem here.
   
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to