[ 
https://issues.apache.org/jira/browse/HIVE-29009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17966273#comment-17966273
 ] 

László Bodor edited comment on HIVE-29009 at 6/12/25 4:26 PM:
--------------------------------------------------------------

thanks [~zratkai] for the headsup and [~zabetak] for the jira!

I created 2 consecutive jstacks of a surefire process for later reference (I 
checked a hanging PR)
 [^jstack.txt]   [^jstack2.txt] 

analyzing soon

found the same stack in another hanging POD
definitely this is the one that hangs while creating a container:
{code}
        at 
org.apache.hive.service.auth.saml.TestHttpSamlAuthentication.setupIDP(TestHttpSamlAuthentication.java:178)
{code}

{code}
"main" #1 prio=5 os_prio=0 cpu=10515.69ms elapsed=14800.79s 
tid=0x00007e17fc026fe0 nid=0x2452 waiting on condition  [0x00007e180171e000]
   java.lang.Thread.State: WAITING (parking)
        at jdk.internal.misc.Unsafe.park(java.base@17.0.9/Native Method)
        - parking to wait for  <0x0000000088200000> (a 
java.util.concurrent.locks.ReentrantLock$NonfairSync)
        at 
java.util.concurrent.locks.LockSupport.park(java.base@17.0.9/LockSupport.java:211)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquire(java.base@17.0.9/AbstractQueuedSynchronizer.java:715)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquire(java.base@17.0.9/AbstractQueuedSynchronizer.java:938)
        at 
java.util.concurrent.locks.ReentrantLock$Sync.lock(java.base@17.0.9/ReentrantLock.java:153)
        at 
java.util.concurrent.locks.ReentrantLock.lock(java.base@17.0.9/ReentrantLock.java:322)
        at 
sun.security.ssl.SSLSocketImpl$AppInputStream.read(java.base@17.0.9/SSLSocketImpl.java:1042)
        at org.testcontainers.shaded.okio.Okio$2.read(Okio.java:140)
        at 
org.testcontainers.shaded.okio.AsyncTimeout$2.read(AsyncTimeout.java:237)
        at 
org.testcontainers.shaded.okio.RealBufferedSource.indexOf(RealBufferedSource.java:358)
        at 
org.testcontainers.shaded.okio.RealBufferedSource.readUtf8LineStrict(RealBufferedSource.java:230)
        at 
org.testcontainers.shaded.okio.RealBufferedSource.readUtf8LineStrict(RealBufferedSource.java:224)
        at 
org.testcontainers.shaded.okhttp3.internal.http1.Http1ExchangeCodec$ChunkedSource.readChunkSize(Http1ExchangeCodec.java:489)
        at 
org.testcontainers.shaded.okhttp3.internal.http1.Http1ExchangeCodec$ChunkedSource.read(Http1ExchangeCodec.java:471)
        at 
org.testcontainers.shaded.okhttp3.internal.Util.skipAll(Util.java:204)
        at 
org.testcontainers.shaded.okhttp3.internal.Util.discard(Util.java:186)
        at 
org.testcontainers.shaded.okhttp3.internal.http1.Http1ExchangeCodec$ChunkedSource.close(Http1ExchangeCodec.java:511)
        at 
org.testcontainers.shaded.okio.ForwardingSource.close(ForwardingSource.java:43)
        at 
org.testcontainers.shaded.okhttp3.internal.connection.Exchange$ResponseBodySource.close(Exchange.java:313)
        at 
org.testcontainers.shaded.okio.RealBufferedSource.close(RealBufferedSource.java:476)
        at 
org.testcontainers.shaded.okhttp3.internal.Util.closeQuietly(Util.java:139)
        at 
org.testcontainers.shaded.okhttp3.ResponseBody.close(ResponseBody.java:192)
        at org.testcontainers.shaded.okhttp3.Response.close(Response.java:290)
        at 
org.testcontainers.shaded.com.github.dockerjava.okhttp.OkDockerHttpClient$OkResponse.close(OkDockerHttpClient.java:285)
        at 
org.testcontainers.shaded.com.github.dockerjava.core.DefaultInvocationBuilder.lambda$null$0(DefaultInvocationBuilder.java:272)
        at 
org.testcontainers.shaded.com.github.dockerjava.core.DefaultInvocationBuilder$$Lambda$508/0x0000000100c15750.close(Unknown
 Source)
        at 
com.github.dockerjava.api.async.ResultCallbackTemplate.close(ResultCallbackTemplate.java:77)
        at 
org.testcontainers.utility.ResourceReaper.start(ResourceReaper.java:205)
        at 
org.testcontainers.DockerClientFactory.client(DockerClientFactory.java:205)
        - locked <0x0000000088202cf8> (a [Ljava.lang.Object;)
        at 
org.testcontainers.LazyDockerClient.getDockerClient(LazyDockerClient.java:14)
        at 
org.testcontainers.LazyDockerClient.authConfig(LazyDockerClient.java:12)
        at 
org.testcontainers.containers.GenericContainer.start(GenericContainer.java:310)
        at 
org.apache.hive.service.auth.saml.TestHttpSamlAuthentication.setupIDP(TestHttpSamlAuthentication.java:178)
        at 
org.apache.hive.service.auth.saml.TestHttpSamlAuthentication.testGroupNameFiltering2(TestHttpSamlAuthentication.java:490)
{code}


was (Author: abstractdog):
thanks [~zratkai] for the headsup and [~zabetak] for the jira!

I created 2 consecutive jstacks of a surefire process for later reference (I 
checked a hanging PR)
 [^jstack.txt]   [^jstack2.txt] 

analyzing soon

> Intermittent CI timeouts while running tests
> --------------------------------------------
>
>                 Key: HIVE-29009
>                 URL: https://issues.apache.org/jira/browse/HIVE-29009
>             Project: Hive
>          Issue Type: Bug
>          Components: Build Infrastructure, Testing Infrastructure
>            Reporter: Stamatis Zampetakis
>            Priority: Major
>         Attachments: jstack.txt, jstack2.txt
>
>
> Recently various CI runs in master and PRs are timing out while executing 
> tests. The problem is intermittent but rather frequent. The first and last 
> (at the time of logging this ticket) timeout failure in master are outlined 
> below:
> First: https://ci.hive.apache.org/job/hive-precommit/job/master/2532/
> Last: https://ci.hive.apache.org/job/hive-precommit/job/master/2546/
> Unfortunately due to HIVE-29008 the CI logs do not contain enough information 
> to easily determine which test is hanging and if it is the same everytime.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to