[ 
https://issues.apache.org/jira/browse/FLINK-23409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17382038#comment-17382038
 ] 

Till Rohrmann edited comment on FLINK-23409 at 7/16/21, 12:28 PM:
------------------------------------------------------------------

Hmm, locally everything passes. Also the build for FLINK-23093 passed.

What I found in the logs is the following:

{code}
09:16:36,416 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
file:/tmp/junit7235602407204969305/junit837890199545668434.tmp, delimiter: ,)) 
(2/4) (61b9f281e369b96551f4f42d7bc6b156) switched from CREATED to SCHEDULED.
09:16:36,416 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
file:/tmp/junit7235602407204969305/junit837890199545668434.tmp, delimiter: ,)) 
(3/4) (4d49ea1e8acbc3f8368ece58928e6c5f) switched from CREATED to SCHEDULED.
09:16:36,416 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
file:/tmp/junit7235602407204969305/junit837890199545668434.tmp, delimiter: ,)) 
(4/4) (e0e327a62ffe2733022a92cf4c605921) switched from CREATED to SCHEDULED.
09:16:36,419 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.jobmaster.JobMaster                 [] - Connecting to 
ResourceManager 
akka://flink/user/rpc/resourcemanager_2(b2ae4decd3b74394548861ada12c402f)
09:16:36,420 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.jobmaster.JobMaster                 [] - Resolved 
ResourceManager address, beginning registration
09:16:36,421 [flink-akka.actor.default-dispatcher-4] INFO  
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager [] - 
Registering job manager 
ba78e82641830967d1cf39d623934f97@akka://flink/user/rpc/jobmanager_7 for job 
72e507bcdef499061e756475aa153b19.
09:16:36,421 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager [] - 
Registered job manager 
ba78e82641830967d1cf39d623934f97@akka://flink/user/rpc/jobmanager_7 for job 
72e507bcdef499061e756475aa153b19.
09:16:36,422 [flink-akka.actor.default-dispatcher-4] INFO  
org.apache.flink.runtime.jobmaster.JobMaster                 [] - JobManager 
successfully registered at ResourceManager, leader id: 
b2ae4decd3b74394548861ada12c402f.
09:16:36,422 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Received resource requirements from job 72e507bcdef499061e756475aa153b19: 
[ResourceRequirement{resourceProfile=ResourceProfile{UNKNOWN}, 
numberOfRequiredSlots=4}]
09:17:33,474 [flink-akka.actor.default-dispatcher-11] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:18:03,493 [flink-akka.actor.default-dispatcher-22] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:18:33,512 [flink-akka.actor.default-dispatcher-28] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:19:03,532 [flink-akka.actor.default-dispatcher-34] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:19:33,552 [flink-akka.actor.default-dispatcher-37] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:20:03,587 [flink-akka.actor.default-dispatcher-45] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:20:33,601 [flink-akka.actor.default-dispatcher-51] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:21:03,613 [flink-akka.actor.default-dispatcher-56] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:21:33,632 [flink-akka.actor.default-dispatcher-64] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:21:36,443 [flink-akka.actor.default-dispatcher-65] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSource 
(at 
org.apache.flink.api.scala.util.CollectionDataSets$.getSmall5TupleDataSet(CollectionDataSets.scala:94)
 (org.apache.flink.api.java.io.CollectionInpu) (1/1) 
(9e7f450b1c1d7ef21005a7f237f8aae7) switched from SCHEDULED to FAILED on 
[unassigned resource].
java.util.concurrent.CompletionException: 
org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: 
Slot request bulk is not fulfillable! Could not allocate the required slot 
within slot request timeout
{code}

Note that there are quite some gaps in the timestamps. This could point towards 
some infrastructure problems.

Also, the build runs with fine grained resource management. Maybe this is 
affecting the test case. cc [~xintongsong].


was (Author: till.rohrmann):
Hmm, locally everything passes. Also the build for FLINK-23093 passed.

What I found in the logs is the following:

{code}
09:16:36,416 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
file:/tmp/junit7235602407204969305/junit837890199545668434.tmp, delimiter: ,)) 
(2/4) (61b9f281e369b96551f4f42d7bc6b156) switched from CREATED to SCHEDULED.
09:16:36,416 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
file:/tmp/junit7235602407204969305/junit837890199545668434.tmp, delimiter: ,)) 
(3/4) (4d49ea1e8acbc3f8368ece58928e6c5f) switched from CREATED to SCHEDULED.
09:16:36,416 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
file:/tmp/junit7235602407204969305/junit837890199545668434.tmp, delimiter: ,)) 
(4/4) (e0e327a62ffe2733022a92cf4c605921) switched from CREATED to SCHEDULED.
09:16:36,419 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.jobmaster.JobMaster                 [] - Connecting to 
ResourceManager 
akka://flink/user/rpc/resourcemanager_2(b2ae4decd3b74394548861ada12c402f)
09:16:36,420 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.jobmaster.JobMaster                 [] - Resolved 
ResourceManager address, beginning registration
09:16:36,421 [flink-akka.actor.default-dispatcher-4] INFO  
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager [] - 
Registering job manager 
ba78e82641830967d1cf39d623934f97@akka://flink/user/rpc/jobmanager_7 for job 
72e507bcdef499061e756475aa153b19.
09:16:36,421 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager [] - 
Registered job manager 
ba78e82641830967d1cf39d623934f97@akka://flink/user/rpc/jobmanager_7 for job 
72e507bcdef499061e756475aa153b19.
09:16:36,422 [flink-akka.actor.default-dispatcher-4] INFO  
org.apache.flink.runtime.jobmaster.JobMaster                 [] - JobManager 
successfully registered at ResourceManager, leader id: 
b2ae4decd3b74394548861ada12c402f.
09:16:36,422 [flink-akka.actor.default-dispatcher-6] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Received resource requirements from job 72e507bcdef499061e756475aa153b19: 
[ResourceRequirement{resourceProfile=ResourceProfile{UNKNOWN}, 
numberOfRequiredSlots=4}]
09:17:33,474 [flink-akka.actor.default-dispatcher-11] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:18:03,493 [flink-akka.actor.default-dispatcher-22] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:18:33,512 [flink-akka.actor.default-dispatcher-28] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:19:03,532 [flink-akka.actor.default-dispatcher-34] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:19:33,552 [flink-akka.actor.default-dispatcher-37] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:20:03,587 [flink-akka.actor.default-dispatcher-45] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:20:33,601 [flink-akka.actor.default-dispatcher-51] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:21:03,613 [flink-akka.actor.default-dispatcher-56] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:21:33,632 [flink-akka.actor.default-dispatcher-64] INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.FineGrainedSlotManager [] 
- Release TaskManager a61dbdd440a00d4a7dd5d7e16944f57f because it exceeded the 
idle timeout.
09:21:36,443 [flink-akka.actor.default-dispatcher-65] INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSource 
(at 
org.apache.flink.api.scala.util.CollectionDataSets$.getSmall5TupleDataSet(CollectionDataSets.scala:94)
 (org.apache.flink.api.java.io.CollectionInpu) (1/1) 
(9e7f450b1c1d7ef21005a7f237f8aae7) switched from SCHEDULED to FAILED on 
[unassigned resource].
java.util.concurrent.CompletionException: 
org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: 
Slot request bulk is not fulfillable! Could not allocate the required slot 
within slot request timeout
{code}

Note that there are quite some gaps in the timestamps. Also, the build runs 
with fine grained resource management. Maybe this is affecting it 
[~xintongsong].

> CrossITCase fails with "NoResourceAvailableException: Slot request bulk is 
> not fulfillable! Could not allocate the required slot within slot request 
> timeout"
> -------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-23409
>                 URL: https://issues.apache.org/jira/browse/FLINK-23409
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Coordination, Table SQL / Planner
>    Affects Versions: 1.14.0
>            Reporter: Dawid Wysakowicz
>            Priority: Major
>              Labels: test-stability
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=20548&view=logs&j=a57e0635-3fad-5b08-57c7-a4142d7d6fa9&t=5360d54c-8d94-5d85-304e-a89267eb785a&l=10074
> {code}
> Jul 16 09:21:37       at 
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
> Jul 16 09:21:37       at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
> Jul 16 09:21:37       at 
> akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
> Jul 16 09:21:37       at 
> akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
> Jul 16 09:21:37       at akka.actor.ActorCell.invoke(ActorCell.scala:561)
> Jul 16 09:21:37       at 
> akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
> Jul 16 09:21:37       at akka.dispatch.Mailbox.run(Mailbox.scala:225)
> Jul 16 09:21:37       at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
> Jul 16 09:21:37       ... 4 more
> Jul 16 09:21:37 Caused by: java.util.concurrent.CompletionException: 
> org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: 
> Slot request bulk is not fulfillable! Could not allocate the required slot 
> within slot request timeout
> Jul 16 09:21:37       at 
> java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:292)
> Jul 16 09:21:37       at 
> java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:308)
> Jul 16 09:21:37       at 
> java.util.concurrent.CompletableFuture.uniApply(CompletableFuture.java:607)
> Jul 16 09:21:37       at 
> java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:591)
> Jul 16 09:21:37       ... 31 more
> Jul 16 09:21:37 Caused by: 
> org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: 
> Slot request bulk is not fulfillable! Could not allocate the required slot 
> within slot request timeout
> Jul 16 09:21:37       at 
> org.apache.flink.runtime.jobmaster.slotpool.PhysicalSlotRequestBulkCheckerImpl.lambda$schedulePendingRequestBulkWithTimestampCheck$0(PhysicalSlotRequestBulkCheckerImpl.java:86)
> Jul 16 09:21:37       ... 24 more
> Jul 16 09:21:37 Caused by: java.util.concurrent.TimeoutException: Timeout has 
> occurred: 300000 ms
> Jul 16 09:21:37       ... 25 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to