Marcelo Vanzin created SPARK-21728:
--------------------------------------

             Summary: Allow SparkSubmit to use logging
                 Key: SPARK-21728
                 URL: https://issues.apache.org/jira/browse/SPARK-21728
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
    Affects Versions: 2.3.0
            Reporter: Marcelo Vanzin
            Priority: Minor


Currently, code in {{SparkSubmit}} cannot call classes or methods that 
initialize the Spark {{Logging}} framework. That is because at that time 
{{SparkSubmit}} doesn't yet know which application will run, and logging is 
initialized differently for certain special applications (notably, the shells).

It would be better if either {{SparkSubmit}} did logging initialization earlier 
based on the application to be run, or did it in a way that could be overridden 
later when the app initializes.

Without this, there are currently a few parts of {{SparkSubmit}} that 
duplicates code from other parts of Spark just to avoid logging. For example:

* 
[downloadFiles|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala#L860]
 replicates code from Utils.scala
* 
[createTempDir|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/DependencyUtils.scala#L54]
 replicates code from Utils.scala and installs its own shutdown hook
* a few parts of the code could use {{SparkConf}} but can't right now because 
of the logging issue.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to