[ 
https://issues.apache.org/jira/browse/HDDS-10750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17841300#comment-17841300
 ] 

Attila Doroszlai commented on HDDS-10750:
-----------------------------------------

Affects Ratis server in datanode, OM and SCM, too:

* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/01/24/28749/it-om/2024-01-24T20-36-21_537-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/01/26/28794/it-flaky/2024-01-26T06-04-59_796-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/01/26/28826/it-flaky/2024-01-26T21-20-16_332-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/01/27/28836/it-flaky/2024-01-27T12-33-29_143-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/01/30/28939/it-om/2024-01-31T00-10-34_625-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/02/02/29010/it-flaky/2024-02-02T08-48-20_084-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/02/09/29222/it-flaky/2024-02-09T18-22-22_464-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/02/10/29239/it-flaky/2024-02-10T22-32-58_069-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/02/15/29348/it-flaky/2024-02-15T17-33-11_436-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/03/23/30175/it-flaky/2024-03-23T14-54-03_236-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/04/16/30689/it-container/2024-04-16T15-35-52_036-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/04/19/30765/it-hdds/2024-04-19T09-12-12_663-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/04/21/30803/it-client/2024-04-21T16-53-06_683-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/04/26/30910/it-hdds/2024-04-26T09-26-21_192-jvmRun1.dump
* 
https://github.com/adoroszlai/ozone-build-results/blob/master/2024/04/26/30910/it-om/2024-04-26T09-12-15_808-jvmRun1.dump


> Intermittent fork timeout while stopping Ratis server
> -----------------------------------------------------
>
>                 Key: HDDS-10750
>                 URL: https://issues.apache.org/jira/browse/HDDS-10750
>             Project: Apache Ozone
>          Issue Type: Sub-task
>            Reporter: Attila Doroszlai
>            Priority: Major
>         Attachments: 2024-04-21T16-53-06_683-jvmRun1.dump, 
> org.apache.hadoop.ozone.client.rpc.TestECKeyOutputStreamWithZeroCopy-output.txt
>
>
> {code:title=https://github.com/adoroszlai/ozone-build-results/blob/master/2024/04/21/30803/it-client/output.log}
> [INFO] Running 
> org.apache.hadoop.ozone.client.rpc.TestECKeyOutputStreamWithZeroCopy
> [INFO] 
> [INFO] Results:
> ...
> ... There was a timeout or other error in the fork
> {code}
> {code}
> "main" 
>    java.lang.Thread.State: WAITING
>         at java.lang.Object.wait(Native Method)
>         at 
> java.util.concurrent.ForkJoinTask.externalAwaitDone(ForkJoinTask.java:334)
>         at java.util.concurrent.ForkJoinTask.doInvoke(ForkJoinTask.java:405)
>         at java.util.concurrent.ForkJoinTask.invoke(ForkJoinTask.java:734)
>         at 
> java.util.stream.ForEachOps$ForEachOp.evaluateParallel(ForEachOps.java:159)
>         at 
> java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateParallel(ForEachOps.java:173)
>         at 
> java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:233)
>         at 
> java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:485)
>         at 
> java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:650)
>         at 
> org.apache.hadoop.ozone.MiniOzoneClusterImpl.stopDatanodes(MiniOzoneClusterImpl.java:473)
>         at 
> org.apache.hadoop.ozone.MiniOzoneClusterImpl.stop(MiniOzoneClusterImpl.java:414)
>         at 
> org.apache.hadoop.ozone.MiniOzoneClusterImpl.shutdown(MiniOzoneClusterImpl.java:400)
>         at 
> org.apache.hadoop.ozone.client.rpc.AbstractTestECKeyOutputStream.shutdown(AbstractTestECKeyOutputStream.java:160)
> "ForkJoinPool.commonPool-worker-7" 
>    java.lang.Thread.State: TIMED_WAITING
> ...
>         at 
> java.util.concurrent.ThreadPoolExecutor.awaitTermination(ThreadPoolExecutor.java:1475)
>         at 
> org.apache.ratis.util.ConcurrentUtils.shutdownAndWait(ConcurrentUtils.java:144)
>         at 
> org.apache.ratis.util.ConcurrentUtils.shutdownAndWait(ConcurrentUtils.java:136)
>         at 
> org.apache.ratis.server.impl.RaftServerProxy.lambda$close$9(RaftServerProxy.java:438)
>         at 
> org.apache.ratis.server.impl.RaftServerProxy$$Lambda$1923/977291773.run(Unknown
>  Source)
>         at 
> org.apache.ratis.util.LifeCycle.lambda$checkStateAndClose$7(LifeCycle.java:306)
>         at org.apache.ratis.util.LifeCycle$$Lambda$1204/655954062.get(Unknown 
> Source)
>         at 
> org.apache.ratis.util.LifeCycle.checkStateAndClose(LifeCycle.java:326)
>         at 
> org.apache.ratis.util.LifeCycle.checkStateAndClose(LifeCycle.java:304)
>         at 
> org.apache.ratis.server.impl.RaftServerProxy.close(RaftServerProxy.java:415)
>         at 
> org.apache.hadoop.ozone.container.common.transport.server.ratis.XceiverServerRatis.stop(XceiverServerRatis.java:603)
>         at 
> org.apache.hadoop.ozone.container.ozoneimpl.OzoneContainer.stop(OzoneContainer.java:484)
>         at 
> org.apache.hadoop.ozone.container.common.statemachine.DatanodeStateMachine.close(DatanodeStateMachine.java:447)
>         at 
> org.apache.hadoop.ozone.container.common.statemachine.DatanodeStateMachine.stopDaemon(DatanodeStateMachine.java:637)
>         at 
> org.apache.hadoop.ozone.HddsDatanodeService.stop(HddsDatanodeService.java:550)
>         at 
> org.apache.hadoop.ozone.MiniOzoneClusterImpl.stopDatanode(MiniOzoneClusterImpl.java:479)
>         at 
> org.apache.hadoop.ozone.MiniOzoneClusterImpl$$Lambda$2077/645273703.accept(Unknown
>  Source)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to