Alan Myrvold created BEAM-3585:
----------------------------------

             Summary: Python dataflow job fails with 2.3.0 RC1, due to missing 
worker image
                 Key: BEAM-3585
                 URL: https://issues.apache.org/jira/browse/BEAM-3585
             Project: Beam
          Issue Type: Bug
          Components: examples-python
    Affects Versions: 2.3.0
            Reporter: Alan Myrvold
            Assignee: Alan Myrvold


The dataflow python jobs currently fail due to a missing docker image when 
using 2.3.0 RC1. This is not a bug in the SDK, the worker image needs to be 
published by google. I will be coordinating the worker image publication.

# Update to your own project and bucket.

GCS_BUCKET=my-cloud-storage-bucket

GCP_PROJECT=my-cloud-project

virtualenv env

. env/bin/activate

wget 
https://dist.apache.org/repos/dist/dev/beam/2.3.0/apache-beam-2.3.0-python.zip

pip install apache-beam-2.3.0-python.zip[gcp]

python -m apache_beam.examples.wordcount --input 
gs://dataflow-samples/shakespeare/kinglear.txt --output 
gs://${GCS_BUCKET}/counts  --runner DataflowRunner --project ${GCP_PROJECT} 
--temp_location gs://${GCS_BUCKET}/tmp --sdk_location 
apache-beam-2.3.0-python.zip

 

Dataflow logs contain:

{{E  Handler for GET /v1.27/images/dataflow.gcr.io/v1beta3/python:2.3.0/json 
returned error: No such image: dataflow.gcr.io/v1beta3/python:2.3.0 }}
{{E  container start failed: ImagePullBackOff: Back-off pulling image 
"dataflow.gcr.io/v1beta3/python:2.3.0"}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to