Marcelo Vanzin created SPARK-21728:
--------------------------------------
Summary: Allow SparkSubmit to use logging
Key: SPARK-21728
URL: https://issues.apache.org/jira/browse/SPARK-21728
Project: Spark
Issue Type: Improvement
Components: Spark Core
Affects Versions: 2.3.0
Reporter: Marcelo Vanzin
Priority: Minor
Currently, code in {{SparkSubmit}} cannot call classes or methods that
initialize the Spark {{Logging}} framework. That is because at that time
{{SparkSubmit}} doesn't yet know which application will run, and logging is
initialized differently for certain special applications (notably, the shells).
It would be better if either {{SparkSubmit}} did logging initialization earlier
based on the application to be run, or did it in a way that could be overridden
later when the app initializes.
Without this, there are currently a few parts of {{SparkSubmit}} that
duplicates code from other parts of Spark just to avoid logging. For example:
*
[downloadFiles|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala#L860]
replicates code from Utils.scala
*
[createTempDir|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/DependencyUtils.scala#L54]
replicates code from Utils.scala and installs its own shutdown hook
* a few parts of the code could use {{SparkConf}} but can't right now because
of the logging issue.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]