Chris Chow created AIRFLOW-2469:
-----------------------------------

             Summary: example task in documentation causes dataflow operator to 
fail
                 Key: AIRFLOW-2469
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-2469
             Project: Apache Airflow
          Issue Type: Bug
          Components: Dataflow
    Affects Versions: 1.9.0
            Reporter: Chris Chow


https://github.com/apache/incubator-airflow/blob/c7a472ed6b0d8a4720f57ba1140c8cf665757167/airflow/contrib/operators/dataflow_operator.py#L176
{noformat}
t1 = DataflowTemplateOperator(
task_id='datapflow_example',
template='{{var.value.gcp_dataflow_base}}',
parameters={
'inputFile': "gs://bucket/input/my_input.txt",
'outputFile': "gs://bucket/output/my_output.txt"
},
gcp_conn_id='gcp-airflow-service-account',
dag=my-dag){noformat}
If you actually name a dataflow task 'datapflow_example', the Google dataflow 
service will not accept the job because it is not named correctly. Dataflow job 
names can't have '_' in them. Strictly speaking, apache beam jobnames must 
adhere to the regex 

[a-z]([-a-z0-9]*[a-z0-9])?.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to