Attila Doroszlai created HDDS-10176:
---------------------------------------

             Summary: Intermittent timeout shutting down MiniOzoneCluster
                 Key: HDDS-10176
                 URL: https://issues.apache.org/jira/browse/HDDS-10176
             Project: Apache Ozone
          Issue Type: Bug
            Reporter: Attila Doroszlai


Cluster shutdown is waiting for:

{code}
"main" 
   java.lang.Thread.State: TIMED_WAITING
        at sun.misc.Unsafe.park(Native Method)
        at 
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:215)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2078)
        at 
java.util.concurrent.ThreadPoolExecutor.awaitTermination(ThreadPoolExecutor.java:1475)
        at 
org.apache.ratis.util.ConcurrentUtils.shutdownAndWait(ConcurrentUtils.java:144)
        at 
org.apache.ratis.util.ConcurrentUtils.shutdownAndWait(ConcurrentUtils.java:136)
        at 
org.apache.ratis.server.impl.RaftServerProxy.lambda$close$9(RaftServerProxy.java:438)
        at 
org.apache.ratis.server.impl.RaftServerProxy$$Lambda$1625/1179968371.run(Unknown
 Source)
        at 
org.apache.ratis.util.LifeCycle.lambda$checkStateAndClose$7(LifeCycle.java:306)
        at org.apache.ratis.util.LifeCycle$$Lambda$1127/1377650693.get(Unknown 
Source)
        at 
org.apache.ratis.util.LifeCycle.checkStateAndClose(LifeCycle.java:326)
        at 
org.apache.ratis.util.LifeCycle.checkStateAndClose(LifeCycle.java:304)
        at 
org.apache.ratis.server.impl.RaftServerProxy.close(RaftServerProxy.java:415)
        at 
org.apache.hadoop.ozone.om.ratis.OzoneManagerRatisServer.stop(OzoneManagerRatisServer.java:597)
        at org.apache.hadoop.ozone.om.OzoneManager.stop(OzoneManager.java:2226)
        at 
org.apache.hadoop.ozone.MiniOzoneClusterImpl.stopOM(MiniOzoneClusterImpl.java:557)
        at 
org.apache.hadoop.ozone.MiniOzoneHAClusterImpl.stop(MiniOzoneHAClusterImpl.java:311)
        at 
org.apache.hadoop.ozone.MiniOzoneClusterImpl.shutdown(MiniOzoneClusterImpl.java:453)
        at 
org.apache.hadoop.ozone.om.TestOzoneManagerHA.shutdown(TestOzoneManagerHA.java:209)
{code}

Ratis might wait up to 1 day:

{code:title=https://github.com/apache/ratis/blob/e6f2554610941d8ca18aee6df1d5c0e78275a003/ratis-server/src/main/java/org/apache/ratis/server/impl/RaftServerProxy.java#L438}
        ConcurrentUtils.shutdownAndWait(executor.get());
{code}

{code:title=https://github.com/apache/ratis/blob/561cd6d48ab95ff72bbfd4a89c9074581d2ad1bf/ratis-common/src/main/java/org/apache/ratis/util/ConcurrentUtils.java#L135-L136}
  static void shutdownAndWait(ExecutorService executor) {
    shutdownAndWait(TimeDuration.ONE_DAY, executor, timeout -> {
{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to