[ 
https://issues.apache.org/jira/browse/SPARK-16784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776633#comment-16776633
 ] 

Narcis Andrei Moga edited comment on SPARK-16784 at 2/25/19 8:33 AM:
---------------------------------------------------------------------

I have the same problem after migration from Spark 2.2.1 to 2.4.0 and deploy 
mode  cluster & standalone namager (it not happens in client mode deploy)

I test in docker and all required files are present in all containers (1 master 
& 2 workers - Spark have no config for this test - it is just untar)

*1) Executor command observed in the stderr file*

Spark Executor Command: "/srv/java/jdk/bin/java" "-cp" 
"/usr/lib/spark/conf/:/usr/lib/spark/jars/*" "-Xmx1024M" 
"-Dspark.driver.port=45431" "-Dspark.cassandra.connection.port=9042" 
 "-Dspark.rpc.askTimeout=10s" "-Dspark.application.ldap.port=55389" 
_*"-Duser.timezone=UTC"*_ 
_*"-Dlog4j.configuration=[file:///log4j.properties.executor]"*_

"-Dcom.sun.management.jmxremote" 
 "-Dcom.sun.management.jmxremote.authenticate=false"

"-Dcom.sun.management.jmxremote.local.only=false"

"-Dcom.sun.management.jmxremote.ssl=false" "-Djava.net.preferIPv4Stack=true" 
 "-Dcom.sun.management.jmxremote.port=0" 
"-Djava.util.logging.config.file=/jmx-logging.properties" 
"org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url" 
 "spark://CoarseGrainedScheduler@c1-spark-executor2:45431" "--executor-id" "1" 
"--hostname" "172.18.0.22" "--cores" "1" "--app-id" "app-20190224171936-0010" 
"--worker-url" 
 "spark://[email protected]:36555"

*2) Partial command of the Driver observed in the stderr file*

Launch Command: "/srv/java/jdk/bin/java" "-cp" 
"/usr/lib/spark/conf/:/usr/lib/spark/jars/*" "-Xmx1024M" 
 _*"-Dspark.driver.extraJavaOptions=-Duser.timezone=UTC 
-Dlog4j.configuration=[file:///log4j.properties.driver*_]
 "-Dspark.kafka.ppu.topic.name=..." 
 ....

*3) Submit command*

spark-submit \
 --deploy-mode cluster \
 --master spark://172.18.0.20:7077 \
 --properties-file /application.properties \
 --class com....... \
 /logs-correlation-2.4.1-1.noarch.jar

*4) application.properties contains*

spark.driver.extraJavaOptions=-Duser.timezone=UTC 
-Dlog4j.configuration=[file:///log4j.properties.driver]

spark.executor.extraJavaOptions=-Duser.timezone=UTC 
-Dlog4j.configuration=[file:///log4j.properties.executor]

 

 


was (Author: andreim):
I have the same problem after migration from Spark 2.2.1 to 2.4.0 and deploy 
mode  cluster & standalone namager (it not happens in client mode deploy)

I test in docker and all required files are present in all containers (1 master 
& 2 workers - Spark have no config for this test - it is just untar)

*1) Executor command observed in the stderr file*

Spark Executor Command: "/srv/java/jdk/bin/java" "-cp" 
"/usr/lib/spark/conf/:/usr/lib/spark/jars/*" "-Xmx1024M" 
"-Dspark.driver.port=45431" "-Dspark.cassandra.connection.port=9042" 
"-Dspark.rpc.askTimeout=10s" "-Dspark.application.ldap.port=55389" 
_*"-Duser.timezone=UTC"*_ 
_*"-Dlog4j.configuration=file:///log4j.properties.executor"*_ 
"-Dcom.sun.management.jmxremote" 
"-Dcom.sun.management.jmxremote.authenticate=false" 
"-Dcom.sun.management.jmxremote.local.only=false" 
"-Dcom.sun.management.jmxremote.ssl=false" "-Djava.net.preferIPv4Stack=true" 
"-Dcom.sun.management.jmxremote.port=0" 
"-Djava.util.logging.config.file=/jmx-logging.properties" 
"org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url" 
"spark://CoarseGrainedScheduler@c1-spark-executor2:45431" "--executor-id" "1" 
"--hostname" "172.18.0.22" "--cores" "1" "--app-id" "app-20190224171936-0010" 
"--worker-url" 
"spark://[email protected]:36555"

*2) Partial command of the Driver observed in the stderr file*

Launch Command: "/srv/java/jdk/bin/java" "-cp" 
"/usr/lib/spark/conf/:/usr/lib/spark/jars/*" "-Xmx1024M" 
_*"-Dspark.driver.extraJavaOptions=-Duser.timezone=UTC 
-Dlog4j.configuration=file:///log4j.properties.driver*_
"-Dspark.kafka.ppu.topic.name=..." 
....

*3) Submit command*

spark-submit \
--deploy-mode cluster \
--master spark://172.18.0.20:7077 \
--properties-file /application.properties \
--class com....... \
/logs-correlation-2.4.1-1.noarch.jar

*4) application.properties contains*

spark.driver.extraJavaOptions=-Duser.timezone=UTC 
-Dlog4j.configuration=file:///log4j.properties.driver

spark.executor.extraJavaOptions=-Duser.timezone=UTC 
-Dlog4j.configuration=file:///log4j.properties.executor

 

 

> Configurable log4j settings
> ---------------------------
>
>                 Key: SPARK-16784
>                 URL: https://issues.apache.org/jira/browse/SPARK-16784
>             Project: Spark
>          Issue Type: Improvement
>    Affects Versions: 2.0.0, 2.1.0
>            Reporter: Michael Gummelt
>            Priority: Major
>
> I often want to change the logging configuration on a single spark job.  This 
> is easy in client mode.  I just modify log4j.properties.  It's difficult in 
> cluster mode, because I need to modify the log4j.properties in the 
> distribution in which the driver runs.  I'd like a way of setting this 
> dynamically, such as a java system property.  Some brief searching showed 
> that log4j doesn't seem to accept such a property, but I'd like to open up 
> this idea for further comment.  Maybe we can find a solution.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to