[
https://issues.apache.org/jira/browse/AIRFLOW-2773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aizhamal Nurmamat kyzy updated AIRFLOW-2773:
--------------------------------------------
Component/s: (was: Dataflow)
Removing 'dataflow' component tag for component refactor.
> DataFlowPythonOperator does not handle correctly task_id containing
> underscores
> -------------------------------------------------------------------------------
>
> Key: AIRFLOW-2773
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2773
> Project: Apache Airflow
> Issue Type: Bug
> Components: gcp
> Affects Versions: 1.9.0
> Reporter: Evgeny Podlepaev
> Assignee: Kaxil Naik
> Priority: Minor
> Fix For: 2.0.0
>
>
> DataFlowPythonOperator generates a job name that does not get accepted by
> Dataflow API when task_id contains underscores. Example:
> DataFlowPythonOperator(task_id='analyze_search_results', ...)
> will lead to the following error:
> ValueError: Pipeline has validations errors: Invalid job_name
> (analyze_search_results-02e17268); the name must consist of only the
> characters [-a-z0-9], starting with a letter and ending with a letter or
> number.
> The fix seems to be as simple as changing
> DataFlowHook.start_python_dataflow() to do
> name = task_id.replace('_', '-')
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)