Léopold Boudard created BEAM-9645: ------------------------------------- Summary: Python flinkrunner cannot inspect container Key: BEAM-9645 URL: https://issues.apache.org/jira/browse/BEAM-9645 Project: Beam Issue Type: Bug Components: runner-flink Affects Versions: 2.19.0 Environment: dataproc cluster, running flink version 1.9 Reporter: Léopold Boudard
Hi, I'm trying to submit a python pipeline job as portable runner with FlinkRunner, though I can't see error logs since it fails retrieving logs/state from underlying container: {code:java} Caused by: java.io.IOException: Received exit code 1 for command 'docker inspect -f {{.State.Running}} 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. stderr: Error: No such object: 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363bCaused by: java.io.IOException: Received exit code 1 for command 'docker inspect -f {{.State.Running}} 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. stderr: Error: No such object: 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b at org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:234) at org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:168) at org.apache.beam.runners.fnexecution.environment.DockerCommand.isContainerRunning(DockerCommand.java:112) at org.apache.beam.runners.fnexecution.environment.DockerEnvironmentFactory.createEnvironment(DockerEnvironmentFactory.java:165) at org.apache.beam.runners.fnexecution.control.DefaultJobBundleFactory$1.load(DefaultJobBundleFactory.java:200) at org.apache.beam.runners.fnexecution.control.DefaultJobBundleFactory$1.load(DefaultJobBundleFactory.java:184) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3528) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2277) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$Segment.lockedGetOrLoad(LocalCache.java:2154) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2044) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache.get(LocalCache.java:3952) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache.getOrLoad(LocalCache.java:3974) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$LocalLoadingCache.get(LocalCache.java:4958) at org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$LocalLoadingCache.getUnchecked(LocalCache.java:4964) ... 12 more Suppressed: java.io.IOException: Received exit code 1 for command 'docker kill 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. stderr: Error response from daemon: Cannot kill container: 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b: No such container: 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b at org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:234) at org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:168) at org.apache.beam.runners.fnexecution.environment.DockerCommand.killContainer(DockerCommand.java:148) at org.apache.beam.runners.fnexecution.environment.DockerEnvironmentFactory.createEnvironment(DockerEnvironmentFactory.java:192) ... 22 moreERROR:root:java.io.IOException: Received exit code 1 for command 'docker inspect -f {{.State.Running}} 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. stderr: Error: No such object: 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b[flink-runner-job-invoker] INFO org.apache.beam.runners.fnexecution.artifact.AbstractArtifactRetrievalService - Manifest at /var/folders/s3/29yl1s8125j33_9vb5tczdxw0000gn/T/beam-tempiwx1szcz/artifacts6ptzmo6u/job_a340349c-cc95-4e32-9cbd-7f915c2d0407/MANIFEST has 1 artifact locations[flink-runner-job-invoker] INFO org.apache.beam.runners.fnexecution.artifact.BeamFileSystemArtifactStagingService - Removed dir /var/folders/s3/29yl1s8125j33_9vb5tczdxw0000gn/T/beam-tempiwx1szcz/artifacts6ptzmo6u/job_a340349c-cc95-4e32-9cbd-7f915c2d0407/Traceback (most recent call last): File "importer/test_runner.py", line 46, in <module> run() File "importer/test_runner.py", line 41, in run | 'write to file' >> WriteToText(known_args.output) File "/Users/leopold/.pyenv/versions/BenchmarkListingStreaming/lib/python3.6/site-packages/apache_beam/pipeline.py", line 481, in __exit__ self.run().wait_until_finish() File "/Users/leopold/.pyenv/versions/BenchmarkListingStreaming/lib/python3.6/site-packages/apache_beam/runners/portability/portable_runner.py", line 455, in wait_until_finish self._job_id, self._state, self._last_error_message())) {code} Job args --runner FlinkRunner --flink_master=\{flink_master} --flink_version 1.9 Could you advise on this issue please? Thanks! -- This message was sent by Atlassian Jira (v8.3.4#803005)