[ 
https://issues.apache.org/jira/browse/AIRFLOW-4717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ash Berlin-Taylor updated AIRFLOW-4717:
---------------------------------------
    Summary: The spark_binary arg to SparkSubmitOperator has no effect when 
connection exists.  (was: The spark_binary option does not apply in 
sparkSubmitOperator.)

> The spark_binary arg to SparkSubmitOperator has no effect when connection 
> exists.
> ---------------------------------------------------------------------------------
>
>                 Key: AIRFLOW-4717
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-4717
>             Project: Apache Airflow
>          Issue Type: Bug
>          Components: hooks
>    Affects Versions: 1.10.3
>            Reporter: yunsu lee
>            Assignee: yunsu lee
>            Priority: Major
>
> Apache spark depending on the desto, the spark binary name may be different. 
> (ex. spark2-submit)
> For this reason, the spark_binary option has been added to sparkSubmitOperator
> (Reference : [https://github.com/apache/airflow/pull/4360/files])
>  
> However, this option does not work.
> This is because there is logic to hard-code and override the spark-binary 
> option value in spark_submit_hook.py
> (Full path : airflow/contrib/hooks/spark_submit_hook.py)
>  
> {code:java}
> ...
> conn_data['spark_binary'] = extra.get('spark-binary', "spark-submit")
> ...{code}
> It is necessary to delete the corresponding line.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to