yunsu lee created AIRFLOW-4717:
----------------------------------

             Summary: The spark_binary option does not apply in 
sparkSubmitOperator.
                 Key: AIRFLOW-4717
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-4717
             Project: Apache Airflow
          Issue Type: Bug
          Components: hooks
    Affects Versions: 1.10.3
            Reporter: yunsu lee


Apache spark depending on the desto, the spark binary name may be different. 
(ex. spark2-submit)
For this reason, the spark_binary option has been added to sparkSubmitOperator

(Reference : [https://github.com/apache/airflow/pull/4360/files])

 

However, this option does not work.

This is because there is logic to hard-code and override the spark-binary 
option value in spark_submit_hook.py

(Full path : airflow/contrib/hooks/spark_submit_hook.py)

 
{code:java}
...
conn_data['spark_binary'] = extra.get('spark-binary', "spark-submit")
...{code}
It is necessary to delete the corresponding line.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to