Alan Myrvold created BEAM-3585: ---------------------------------- Summary: Python dataflow job fails with 2.3.0 RC1, due to missing worker image Key: BEAM-3585 URL: https://issues.apache.org/jira/browse/BEAM-3585 Project: Beam Issue Type: Bug Components: examples-python Affects Versions: 2.3.0 Reporter: Alan Myrvold Assignee: Alan Myrvold
The dataflow python jobs currently fail due to a missing docker image when using 2.3.0 RC1. This is not a bug in the SDK, the worker image needs to be published by google. I will be coordinating the worker image publication. # Update to your own project and bucket. GCS_BUCKET=my-cloud-storage-bucket GCP_PROJECT=my-cloud-project virtualenv env . env/bin/activate wget https://dist.apache.org/repos/dist/dev/beam/2.3.0/apache-beam-2.3.0-python.zip pip install apache-beam-2.3.0-python.zip[gcp] python -m apache_beam.examples.wordcount --input gs://dataflow-samples/shakespeare/kinglear.txt --output gs://${GCS_BUCKET}/counts --runner DataflowRunner --project ${GCP_PROJECT} --temp_location gs://${GCS_BUCKET}/tmp --sdk_location apache-beam-2.3.0-python.zip Dataflow logs contain: {{E Handler for GET /v1.27/images/dataflow.gcr.io/v1beta3/python:2.3.0/json returned error: No such image: dataflow.gcr.io/v1beta3/python:2.3.0 }} {{E container start failed: ImagePullBackOff: Back-off pulling image "dataflow.gcr.io/v1beta3/python:2.3.0"}} -- This message was sent by Atlassian JIRA (v7.6.3#76005)