Ah yeah, after sending the email, I saw that the exit code is in the subject line :)
Can you post the entire log? What I find confusing is this log statement: "Stopped BLOB server at 0.0.0.0:6124". The BLOB server is usually only stopped during shutdown. For some reason, the JobManager is in the process of shutting down. On Wed, Jul 29, 2020 at 7:38 AM Robert Metzger <rmetz...@apache.org> wrote: > Hey Alexey, > > What is the exit code of the JobManager? Can you check if it has been > killed by the OOM killer? > You could also try to run the job with DEBUG log level, it might give us > an additional indication why the JVM dies. > What kind of job are you submitting? Is it complicated? > > On Sat, Jul 25, 2020 at 6:43 AM Alexey Trenikhun <yen...@msn.com> wrote: > >> Hello, >> >> I've Flink 1.11.1 session cluster running via docker compose, I upload >> job jar, when I submit job jobmanager exits without any errors in log: >> >> ... >> {"@timestamp":"2020-07-25T04:32:54.007Z","@version":"1","message":"Starting >> execution of job katana-fsp (64ff3943fdc5024c5beef1612518c627) under job >> master id >> 00000000000000000000000000000000.","logger_name":"org.apache.flink.runtime.jobmaster.JobMaster","thread_name":"flink-akka.actor.default-dispatcher-18","level":"INFO","level_value":20000} >> >> {"@timestamp":"2020-07-25T04:32:54.011Z","@version":"1","message":"Stopped >> BLOB server at >> 0.0.0.0:6124","logger_name":"org.apache.flink.runtime.blob.BlobServer","thread_name":"BlobServer >> shutdown hook","level":"INFO","level_value":20000} >> {"@timestamp":"2020-07-25T04:32:54.015Z","@version":"1","message":"Starting >> scheduling with scheduling strategy >> [org.apache.flink.runtime.scheduler.strategy.EagerSchedulingStrategy]","logger_name":"org.apache.flink.runtime.jobmaster.JobMaster","thread_name":"flink-akka.actor.default-dispatcher-18","level":"INFO","level_value":20000} >> {"@timestamp":"2020-07-25T04:32:54.016Z","@version":"1","message":"Job >> katana-fsp (64ff3943fdc5024c5beef1612518c627) switched from state CREATED >> to >> RUNNING.","logger_name":"org.apache.flink.runtime.executiongraph.ExecutionGraph","thread_name":"flink-akka.actor.default-dispatcher-18","level":"INFO","level_value":20000} >> >> Any ideas how to diagnose it? >> >> Thanks, >> Alexey >> >