Léopold Boudard created BEAM-9645:
-------------------------------------

             Summary: Python flinkrunner cannot inspect container
                 Key: BEAM-9645
                 URL: https://issues.apache.org/jira/browse/BEAM-9645
             Project: Beam
          Issue Type: Bug
          Components: runner-flink
    Affects Versions: 2.19.0
         Environment: dataproc cluster, running flink version 1.9
            Reporter: Léopold Boudard


Hi,

I'm trying to submit a python pipeline job as portable runner with FlinkRunner, 
though I can't see error logs since it fails retrieving logs/state from 
underlying container:
{code:java}
Caused by: java.io.IOException: Received exit code 1 for command 'docker 
inspect -f {{.State.Running}} 
248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. stderr: 
Error: No such object: 
248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363bCaused by: 
java.io.IOException: Received exit code 1 for command 'docker inspect -f 
{{.State.Running}} 
248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. stderr: 
Error: No such object: 
248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b at 
org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:234)
 at 
org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:168)
 at 
org.apache.beam.runners.fnexecution.environment.DockerCommand.isContainerRunning(DockerCommand.java:112)
 at 
org.apache.beam.runners.fnexecution.environment.DockerEnvironmentFactory.createEnvironment(DockerEnvironmentFactory.java:165)
 at 
org.apache.beam.runners.fnexecution.control.DefaultJobBundleFactory$1.load(DefaultJobBundleFactory.java:200)
 at 
org.apache.beam.runners.fnexecution.control.DefaultJobBundleFactory$1.load(DefaultJobBundleFactory.java:184)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3528)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2277)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$Segment.lockedGetOrLoad(LocalCache.java:2154)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2044)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache.get(LocalCache.java:3952)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache.getOrLoad(LocalCache.java:3974)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$LocalLoadingCache.get(LocalCache.java:4958)
 at 
org.apache.beam.vendor.guava.v26_0_jre.com.google.common.cache.LocalCache$LocalLoadingCache.getUnchecked(LocalCache.java:4964)
 ... 12 more Suppressed: java.io.IOException: Received exit code 1 for command 
'docker kill 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. 
stderr: Error response from daemon: Cannot kill container: 
248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b: No such 
container: 248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b at 
org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:234)
 at 
org.apache.beam.runners.fnexecution.environment.DockerCommand.runShortCommand(DockerCommand.java:168)
 at 
org.apache.beam.runners.fnexecution.environment.DockerCommand.killContainer(DockerCommand.java:148)
 at 
org.apache.beam.runners.fnexecution.environment.DockerEnvironmentFactory.createEnvironment(DockerEnvironmentFactory.java:192)
 ... 22 moreERROR:root:java.io.IOException: Received exit code 1 for command 
'docker inspect -f {{.State.Running}} 
248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b'. stderr: 
Error: No such object: 
248d660be908ef58385eb962658cee831c4c3b1be9ea1f835e4563f53016363b[flink-runner-job-invoker]
 INFO 
org.apache.beam.runners.fnexecution.artifact.AbstractArtifactRetrievalService - 
Manifest at 
/var/folders/s3/29yl1s8125j33_9vb5tczdxw0000gn/T/beam-tempiwx1szcz/artifacts6ptzmo6u/job_a340349c-cc95-4e32-9cbd-7f915c2d0407/MANIFEST
 has 1 artifact locations[flink-runner-job-invoker] INFO 
org.apache.beam.runners.fnexecution.artifact.BeamFileSystemArtifactStagingService
 - Removed dir 
/var/folders/s3/29yl1s8125j33_9vb5tczdxw0000gn/T/beam-tempiwx1szcz/artifacts6ptzmo6u/job_a340349c-cc95-4e32-9cbd-7f915c2d0407/Traceback
 (most recent call last):  File "importer/test_runner.py", line 46, in <module> 
   run()  File "importer/test_runner.py", line 41, in run    | 'write to file' 
>> WriteToText(known_args.output)  File 
"/Users/leopold/.pyenv/versions/BenchmarkListingStreaming/lib/python3.6/site-packages/apache_beam/pipeline.py",
 line 481, in __exit__    self.run().wait_until_finish()  File 
"/Users/leopold/.pyenv/versions/BenchmarkListingStreaming/lib/python3.6/site-packages/apache_beam/runners/portability/portable_runner.py",
 line 455, in wait_until_finish    self._job_id, self._state, 
self._last_error_message()))
{code}
 

Job args

--runner FlinkRunner --flink_master=\{flink_master} --flink_version 1.9

Could you advise on this issue please?

Thanks!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to