[ 
https://issues.apache.org/jira/browse/AIRFLOW-2773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aizhamal Nurmamat kyzy updated AIRFLOW-2773:
--------------------------------------------
    Component/s: gcp

Moving to gcp component for component refactor.

> DataFlowPythonOperator does not handle correctly task_id containing 
> underscores
> -------------------------------------------------------------------------------
>
>                 Key: AIRFLOW-2773
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-2773
>             Project: Apache Airflow
>          Issue Type: Bug
>          Components: Dataflow, gcp
>    Affects Versions: 1.9.0
>            Reporter: Evgeny Podlepaev
>            Assignee: Kaxil Naik
>            Priority: Minor
>             Fix For: 2.0.0
>
>
> DataFlowPythonOperator generates a job name that does not get accepted by 
> Dataflow API when task_id contains underscores. Example: 
> DataFlowPythonOperator(task_id='analyze_search_results', ...)
> will lead to the following error:
> ValueError: Pipeline has validations errors: Invalid job_name 
> (analyze_search_results-02e17268); the name must consist of only the 
> characters [-a-z0-9], starting with a letter and ending with a letter or 
> number.
> The fix seems to be as simple as changing 
> DataFlowHook.start_python_dataflow() to do
> name = task_id.replace('_', '-')



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to