MartijnVisser commented on PR #21128:
URL: https://github.com/apache/flink/pull/21128#issuecomment-1323541403

   @gaborgsomogyi I did some debugging locally. If you correct the location to 
the directory, the test still fails. For example, the `perJobYarnCluster` fails 
because the job is being cancelled instead of finishing successfully. For 
example, see:
   
   ```
   2022-11-22 12:30:04,606 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - Starting execution of job 'WordCount Example' 
(56e7231d71adead5c4c3409b1e8f295c) under job master id 
00000000000000000000000000000000.
   2022-11-22 12:30:04,612 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - Starting scheduling with scheduling strategy 
[org.apache.flink.runtime.scheduler.strategy.PipelinedRegionSchedulingStrategy]
   2022-11-22 12:30:04,612 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Job WordCount 
Example (56e7231d71adead5c4c3409b1e8f295c) switched from state CREATED to 
RUNNING.
   2022-11-22 12:30:04,616 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - CHAIN 
DataSource (at main(WordCount.java:69) 
(org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at 
main(WordCount.java:84)) -> Combine (SUM(1), at main(WordCount.java:87) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_0_0) 
switched from CREATED to SCHEDULED.
   2022-11-22 12:30:04,616 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - CHAIN 
DataSource (at main(WordCount.java:69) 
(org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at 
main(WordCount.java:84)) -> Combine (SUM(1), at main(WordCount.java:87) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_1_0) 
switched from CREATED to SCHEDULED.
   2022-11-22 12:30:04,616 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Reduce 
(SUM(1), at main(WordCount.java:87) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_0_0) 
switched from CREATED to SCHEDULED.
   2022-11-22 12:30:04,616 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Reduce 
(SUM(1), at main(WordCount.java:87) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_1_0) 
switched from CREATED to SCHEDULED.
   2022-11-22 12:30:04,616 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/junit14650925366352419605, 
delimiter:  )) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_0_0) 
switched from CREATED to SCHEDULED.
   2022-11-22 12:30:04,616 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/junit14650925366352419605, 
delimiter:  )) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_1_0) 
switched from CREATED to SCHEDULED.
   2022-11-22 12:30:04,631 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - Connecting to ResourceManager 
akka.tcp://flink@martijnsmbpm15:58267/user/rpc/resourcemanager_*(00000000000000000000000000000000)
   2022-11-22 12:30:04,871 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Recovered 0 
containers from previous attempts ([]).
   2022-11-22 12:30:04,872 INFO  
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] - 
Recovered 0 workers from previous attempt.
   2022-11-22 12:30:04,921 INFO  org.apache.hadoop.conf.Configuration           
              [] - resource-types.xml not found
   2022-11-22 12:30:04,921 INFO  
org.apache.hadoop.yarn.util.resource.ResourceUtils           [] - Unable to 
find 'resource-types.xml'.
   2022-11-22 12:30:04,928 INFO  
org.apache.hadoop.yarn.util.resource.ResourceUtils           [] - Adding 
resource type - name = memory-mb, units = Mi, type = COUNTABLE
   2022-11-22 12:30:04,928 INFO  
org.apache.hadoop.yarn.util.resource.ResourceUtils           [] - Adding 
resource type - name = vcores, units = , type = COUNTABLE
   2022-11-22 12:30:04,933 INFO  
org.apache.flink.runtime.externalresource.ExternalResourceUtils [] - Enabled 
external resources: []
   2022-11-22 12:30:04,938 INFO  
org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl [] - Upper bound 
of the thread pool size is 500
   2022-11-22 12:30:04,946 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - Resolved ResourceManager address, beginning registration
   2022-11-22 12:30:04,964 INFO  
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] - 
Registering job manager 
[email protected]://flink@martijnsmbpm15:58267/user/rpc/jobmanager_1
 for job 56e7231d71adead5c4c3409b1e8f295c.
   2022-11-22 12:30:04,983 INFO  
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] - 
Registered job manager 
[email protected]://flink@martijnsmbpm15:58267/user/rpc/jobmanager_1
 for job 56e7231d71adead5c4c3409b1e8f295c.
   2022-11-22 12:30:04,991 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - JobManager successfully registered at ResourceManager, 
leader id: 00000000000000000000000000000000.
   2022-11-22 12:30:04,994 INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.DeclarativeSlotManager [] 
- Received resource requirements from job 56e7231d71adead5c4c3409b1e8f295c: 
[ResourceRequirement{resourceProfile=ResourceProfile{UNKNOWN}, 
numberOfRequiredSlots=2}]
   2022-11-22 12:30:05,062 INFO  
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] - 
Requesting new worker with resource spec WorkerResourceSpec {cpuCores=2.0, 
taskHeapSize=25.600mb (26843542 bytes), taskOffHeapSize=0 bytes, 
networkMemSize=64.000mb (67108864 bytes), managedMemSize=230.400mb (241591914 
bytes), numSlots=2}, current pending count: 1.
   2022-11-22 12:30:05,074 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Requesting 
new TaskExecutor container with resource TaskExecutorProcessSpec {cpuCores=2.0, 
frameworkHeapSize=128.000mb (134217728 bytes), frameworkOffHeapSize=128.000mb 
(134217728 bytes), taskHeapSize=25.600mb (26843542 bytes), taskOffHeapSize=0 
bytes, networkMemSize=64.000mb (67108864 bytes), managedMemorySize=230.400mb 
(241591914 bytes), jvmMetaspaceSize=256.000mb (268435456 bytes), 
jvmOverheadSize=192.000mb (201326592 bytes), numSlots=2}, priority 1.
   2022-11-22 12:30:10,496 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Received 1 
containers.
   2022-11-22 12:30:10,497 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Received 1 
containers with priority 1, 1 pending container requests.
   2022-11-22 12:30:10,502 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Removing 
container request Capability[<memory:1024, 
vCores:2>]Priority[1]AllocationRequestId[0]ExecutionTypeRequest[{Execution 
Type: GUARANTEED, Enforce Execution Type: false}].
   2022-11-22 12:30:10,503 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Accepted 1 
requested containers, returned 0 excess containers, 0 pending container 
requests of resource <memory:1024, vCores:2>.
   2022-11-22 12:30:10,503 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - TaskExecutor 
container_1669116565071_0002_01_000002(martijnsmbpm15:58229) will be started on 
martijnsmbpm15 with TaskExecutorProcessSpec {cpuCores=2.0, 
frameworkHeapSize=128.000mb (134217728 bytes), frameworkOffHeapSize=128.000mb 
(134217728 bytes), taskHeapSize=25.600mb (26843542 bytes), taskOffHeapSize=0 
bytes, networkMemSize=64.000mb (67108864 bytes), managedMemorySize=230.400mb 
(241591914 bytes), jvmMetaspaceSize=256.000mb (268435456 bytes), 
jvmOverheadSize=192.000mb (201326592 bytes), numSlots=2}.
   2022-11-22 12:30:10,507 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - TM:Adding 
remoteYarnConfPath 
file:/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/junit15361768760757807396/.flink/application_1669116565071_0002/yarn-site.xml
 to the container local resource bucket
   2022-11-22 12:30:10,747 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Creating 
container launch context for TaskManagers
   2022-11-22 12:30:10,748 INFO  
org.apache.flink.yarn.YarnResourceManagerDriver              [] - Starting 
TaskManagers
   2022-11-22 12:30:10,757 WARN  org.apache.flink.runtime.util.HadoopUtils      
              [] - Could not find Hadoop configuration via any of the supported 
methods (Flink configuration, environment variables).
   2022-11-22 12:30:10,768 INFO  
org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl [] - Processing 
Event EventType: START_CONTAINER for Container 
container_1669116565071_0002_01_000002
   2022-11-22 12:30:10,768 INFO  
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] - 
Requested worker container_1669116565071_0002_01_000002(martijnsmbpm15:58229) 
with resource spec WorkerResourceSpec {cpuCores=2.0, taskHeapSize=25.600mb 
(26843542 bytes), taskOffHeapSize=0 bytes, networkMemSize=64.000mb (67108864 
bytes), managedMemSize=230.400mb (241591914 bytes), numSlots=2}.
   2022-11-22 12:30:14,006 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint        [] - RECEIVED 
SIGNAL 2: SIGINT. Shutting down as requested.
   2022-11-22 12:30:14,008 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint        [] - Shutting 
YarnJobClusterEntrypoint down with application status UNKNOWN. Diagnostics 
Cluster entrypoint has been closed externally..
   2022-11-22 12:30:14,015 INFO  org.apache.flink.runtime.blob.BlobServer       
              [] - Stopped BLOB server at 0.0.0.0:58268
   2022-11-22 12:30:14,023 INFO  
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint [] - Shutting 
down rest endpoint.
   2022-11-22 12:30:14,084 INFO  
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint [] - Removing 
cache directory 
/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/flink-web-9ae99cf9-dc8f-4eb1-85cc-dc57cd6b50f0/flink-web-ui
   2022-11-22 12:30:14,087 INFO  
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint [] - 
http://192.168.2.217:58270 lost leadership
   2022-11-22 12:30:14,087 INFO  
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint [] - Shut down 
complete.
   2022-11-22 12:30:14,087 INFO  
org.apache.flink.runtime.entrypoint.component.DispatcherResourceManagerComponent
 [] - Closing components.
   2022-11-22 12:30:14,087 INFO  
org.apache.flink.runtime.dispatcher.runner.DefaultDispatcherRunner [] - 
DefaultDispatcherRunner was revoked the leadership with leader id 
00000000-0000-0000-0000-000000000000. Stopping the DispatcherLeaderProcess.
   2022-11-22 12:30:14,088 INFO  
org.apache.flink.runtime.dispatcher.runner.JobDispatcherLeaderProcess [] - 
Stopping JobDispatcherLeaderProcess.
   2022-11-22 12:30:14,089 INFO  
org.apache.flink.runtime.resourcemanager.ResourceManagerServiceImpl [] - 
Stopping resource manager service.
   2022-11-22 12:30:14,090 INFO  
org.apache.flink.runtime.resourcemanager.ResourceManagerServiceImpl [] - 
Resource manager service is not running. Ignore revoking leadership.
   2022-11-22 12:30:14,090 INFO  
org.apache.flink.runtime.dispatcher.MiniDispatcher           [] - Stopping 
dispatcher akka.tcp://flink@martijnsmbpm15:58267/user/rpc/dispatcher_0.
   2022-11-22 12:30:14,092 INFO  
org.apache.flink.runtime.dispatcher.MiniDispatcher           [] - Stopping all 
currently running jobs of dispatcher 
akka.tcp://flink@martijnsmbpm15:58267/user/rpc/dispatcher_0.
   2022-11-22 12:30:14,097 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - Stopping the JobMaster for job 'WordCount Example' 
(56e7231d71adead5c4c3409b1e8f295c).
   2022-11-22 12:30:14,100 INFO  
org.apache.flink.runtime.dispatcher.MiniDispatcher           [] - Job 
56e7231d71adead5c4c3409b1e8f295c reached terminal state SUSPENDED.
   2022-11-22 12:30:14,104 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Job WordCount 
Example (56e7231d71adead5c4c3409b1e8f295c) switched from state RUNNING to 
SUSPENDED.
   org.apache.flink.util.FlinkException: Scheduler is being stopped.
        at 
org.apache.flink.runtime.scheduler.SchedulerBase.closeAsync(SchedulerBase.java:637)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.jobmaster.JobMaster.stopScheduling(JobMaster.java:1056)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.jobmaster.JobMaster.stopJobExecution(JobMaster.java:1019)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.jobmaster.JobMaster.onStop(JobMaster.java:442) 
~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStop(RpcEndpoint.java:239)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StartedState.lambda$terminate$0(AkkaRpcActor.java:578)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.concurrent.akka.ClassLoadingUtils.runWithContextClassLoader(ClassLoadingUtils.java:83)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StartedState.terminate(AkkaRpcActor.java:577)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:196)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:24) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:20) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction.applyOrElse(PartialFunction.scala:127) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction.applyOrElse$(PartialFunction.scala:126) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:20) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:175) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:176) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.Actor.aroundReceive(Actor.scala:537) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.Actor.aroundReceive$(Actor.scala:535) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:220) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.ActorCell.receiveMessage(ActorCell.scala:579) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.ActorCell.invoke(ActorCell.scala:547) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:270) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.dispatch.Mailbox.run(Mailbox.scala:231) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.dispatch.Mailbox.exec(Mailbox.scala:243) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:290) [?:?]
        at 
java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1020)
 [?:?]
        at java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1656) [?:?]
        at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1594) 
[?:?]
        at 
java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:183) 
[?:?]
   2022-11-22 12:30:14,120 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - CHAIN 
DataSource (at main(WordCount.java:69) 
(org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at 
main(WordCount.java:84)) -> Combine (SUM(1), at main(WordCount.java:87) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_0_0) 
switched from SCHEDULED to CANCELING.
   2022-11-22 12:30:14,121 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - CHAIN 
DataSource (at main(WordCount.java:69) 
(org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at 
main(WordCount.java:84)) -> Combine (SUM(1), at main(WordCount.java:87) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_0_0) 
switched from CANCELING to CANCELED.
   2022-11-22 12:30:14,123 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Discarding 
the results produced by task execution 
d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_0_0.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - CHAIN 
DataSource (at main(WordCount.java:69) 
(org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at 
main(WordCount.java:84)) -> Combine (SUM(1), at main(WordCount.java:87) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_1_0) 
switched from SCHEDULED to CANCELING.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - CHAIN 
DataSource (at main(WordCount.java:69) 
(org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at 
main(WordCount.java:84)) -> Combine (SUM(1), at main(WordCount.java:87) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_1_0) 
switched from CANCELING to CANCELED.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Discarding 
the results produced by task execution 
d70d4be96afa4bcefa3275ef9b89a439_1b58034f691643610d089711d8321d03_1_0.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Reduce 
(SUM(1), at main(WordCount.java:87) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_0_0) 
switched from SCHEDULED to CANCELING.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Reduce 
(SUM(1), at main(WordCount.java:87) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_0_0) 
switched from CANCELING to CANCELED.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Discarding 
the results produced by task execution 
d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_0_0.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Reduce 
(SUM(1), at main(WordCount.java:87) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_1_0) 
switched from SCHEDULED to CANCELING.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Reduce 
(SUM(1), at main(WordCount.java:87) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_1_0) 
switched from CANCELING to CANCELED.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Discarding 
the results produced by task execution 
d70d4be96afa4bcefa3275ef9b89a439_64a7a9e5ab13db3bb7c7ac4cec59879e_1_0.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/junit14650925366352419605, 
delimiter:  )) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_0_0) 
switched from SCHEDULED to CANCELING.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/junit14650925366352419605, 
delimiter:  )) (1/2) 
(d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_0_0) 
switched from CANCELING to CANCELED.
   2022-11-22 12:30:14,125 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Discarding 
the results produced by task execution 
d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_0_0.
   2022-11-22 12:30:14,126 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/junit14650925366352419605, 
delimiter:  )) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_1_0) 
switched from SCHEDULED to CANCELING.
   2022-11-22 12:30:14,127 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - DataSink 
(CsvOutputFormat (path: 
/var/folders/5d/k7d0ltfn3hgbwlvyc5l9_mvc0000gn/T/junit14650925366352419605, 
delimiter:  )) (2/2) 
(d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_1_0) 
switched from CANCELING to CANCELED.
   2022-11-22 12:30:14,129 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Discarding 
the results produced by task execution 
d70d4be96afa4bcefa3275ef9b89a439_3040bb2235154f828371bc8c04c5a973_1_0.
   2022-11-22 12:30:14,130 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Job 
56e7231d71adead5c4c3409b1e8f295c has been suspended.
   2022-11-22 12:30:14,131 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - Close ResourceManager connection 
bf303538bb378c3126b9d7396fb542f8: Stopping JobMaster for job 'WordCount 
Example' (56e7231d71adead5c4c3409b1e8f295c).
   2022-11-22 12:30:14,146 INFO  
org.apache.flink.runtime.dispatcher.MiniDispatcher           [] - Stopped 
dispatcher akka.tcp://flink@martijnsmbpm15:58267/user/rpc/dispatcher_0.
   2022-11-22 12:30:14,449 WARN  org.apache.hadoop.ipc.Client                   
              [] - Exception encountered while connecting to the server 
   java.io.IOException: Connection reset by peer
        at sun.nio.ch.FileDispatcherImpl.read0(Native Method) ~[?:?]
        at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39) ~[?:?]
        at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:276) ~[?:?]
        at sun.nio.ch.IOUtil.read(IOUtil.java:245) ~[?:?]
        at sun.nio.ch.IOUtil.read(IOUtil.java:223) ~[?:?]
        at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:356) ~[?:?]
        at 
org.apache.hadoop.net.SocketInputStream$Reader.performIO(SocketInputStream.java:57)
 ~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:142) 
~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.net.SocketInputStream.read(SocketInputStream.java:161) 
~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.net.SocketInputStream.read(SocketInputStream.java:131) 
~[hadoop-common-2.10.2.jar:?]
        at java.io.FilterInputStream.read(FilterInputStream.java:133) ~[?:?]
        at java.io.BufferedInputStream.fill(BufferedInputStream.java:252) ~[?:?]
        at java.io.BufferedInputStream.read(BufferedInputStream.java:271) ~[?:?]
        at java.io.DataInputStream.readInt(DataInputStream.java:392) ~[?:?]
        at 
org.apache.hadoop.ipc.Client$IpcStreams.readResponse(Client.java:1865) 
~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.security.SaslRpcClient.saslConnect(SaslRpcClient.java:365) 
~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.ipc.Client$Connection.setupSaslConnection(Client.java:629) 
~[hadoop-common-2.10.2.jar:?]
        at org.apache.hadoop.ipc.Client$Connection.access$2200(Client.java:423) 
~[hadoop-common-2.10.2.jar:?]
        at org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:833) 
~[hadoop-common-2.10.2.jar:?]
        at org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:829) 
~[hadoop-common-2.10.2.jar:?]
        at java.security.AccessController.doPrivileged(Native Method) ~[?:?]
        at javax.security.auth.Subject.doAs(Subject.java:423) ~[?:?]
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1938)
 ~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:828) 
~[hadoop-common-2.10.2.jar:?]
        at org.apache.hadoop.ipc.Client$Connection.access$3700(Client.java:423) 
~[hadoop-common-2.10.2.jar:?]
        at org.apache.hadoop.ipc.Client.getConnection(Client.java:1621) 
~[hadoop-common-2.10.2.jar:?]
        at org.apache.hadoop.ipc.Client.call(Client.java:1450) 
~[hadoop-common-2.10.2.jar:?]
        at org.apache.hadoop.ipc.Client.call(Client.java:1403) 
~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:230)
 ~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:118)
 ~[hadoop-common-2.10.2.jar:?]
        at com.sun.proxy.$Proxy39.stopContainers(Unknown Source) ~[?:?]
        at 
org.apache.hadoop.yarn.api.impl.pb.client.ContainerManagementProtocolPBClientImpl.stopContainers(ContainerManagementProtocolPBClientImpl.java:142)
 ~[hadoop-yarn-common-2.10.2.jar:?]
        at jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
~[?:?]
        at 
jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
 ~[?:?]
        at 
jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 ~[?:?]
        at java.lang.reflect.Method.invoke(Method.java:566) ~[?:?]
        at 
org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:433)
 ~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeMethod(RetryInvocationHandler.java:166)
 ~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invoke(RetryInvocationHandler.java:158)
 ~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeOnce(RetryInvocationHandler.java:96)
 ~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:362)
 ~[hadoop-common-2.10.2.jar:?]
        at com.sun.proxy.$Proxy40.stopContainers(Unknown Source) ~[?:?]
        at 
org.apache.hadoop.yarn.client.api.impl.NMClientImpl.stopContainerInternal(NMClientImpl.java:426)
 ~[hadoop-yarn-client-2.10.2.jar:?]
        at 
org.apache.hadoop.yarn.client.api.impl.NMClientImpl.stopContainer(NMClientImpl.java:303)
 ~[hadoop-yarn-client-2.10.2.jar:?]
        at 
org.apache.hadoop.yarn.client.api.impl.NMClientImpl.cleanupRunningContainers(NMClientImpl.java:121)
 ~[hadoop-yarn-client-2.10.2.jar:?]
        at 
org.apache.hadoop.yarn.client.api.impl.NMClientImpl.serviceStop(NMClientImpl.java:112)
 ~[hadoop-yarn-client-2.10.2.jar:?]
        at 
org.apache.hadoop.service.AbstractService.stop(AbstractService.java:222) 
~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl.serviceStop(NMClientAsyncImpl.java:240)
 ~[hadoop-yarn-client-2.10.2.jar:?]
        at 
org.apache.hadoop.service.AbstractService.stop(AbstractService.java:222) 
~[hadoop-common-2.10.2.jar:?]
        at 
org.apache.flink.yarn.YarnResourceManagerDriver.terminate(YarnResourceManagerDriver.java:223)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager.terminate(ActiveResourceManager.java:184)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.resourcemanager.ResourceManager.stopResourceManagerServices(ResourceManager.java:315)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.resourcemanager.ResourceManager.onStop(ResourceManager.java:301)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStop(RpcEndpoint.java:239)
 ~[flink-dist-1.17-SNAPSHOT.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StartedState.lambda$terminate$0(AkkaRpcActor.java:578)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.concurrent.akka.ClassLoadingUtils.runWithContextClassLoader(ClassLoadingUtils.java:83)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StartedState.terminate(AkkaRpcActor.java:577)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:196)
 ~[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:24) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:20) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction.applyOrElse(PartialFunction.scala:127) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction.applyOrElse$(PartialFunction.scala:126) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:20) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:175) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:176) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.Actor.aroundReceive(Actor.scala:537) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.Actor.aroundReceive$(Actor.scala:535) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:220) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.ActorCell.receiveMessage(ActorCell.scala:579) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.actor.ActorCell.invoke(ActorCell.scala:547) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:270) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.dispatch.Mailbox.run(Mailbox.scala:231) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at akka.dispatch.Mailbox.exec(Mailbox.scala:243) 
[flink-rpc-akka_3f8b2750-5919-429e-97ab-a0077948520e.jar:1.17-SNAPSHOT]
        at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:290) [?:?]
        at 
java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1020)
 [?:?]
        at java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1656) [?:?]
        at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1594) 
[?:?]
        at 
java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:183) 
[?:?]
   2022-11-22 12:30:14,463 ERROR 
org.apache.hadoop.yarn.client.api.impl.NMClientImpl          [] - Failed to 
stop Container container_1669116565071_0002_01_000002when stopping NMClientImpl
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to