Marcelo Vanzin created SPARK-15891:
--------------------------------------
Summary: Make YARN logs less noisy
Key: SPARK-15891
URL: https://issues.apache.org/jira/browse/SPARK-15891
Project: Spark
Issue Type: Bug
Components: YARN
Affects Versions: 2.0.0
Reporter: Marcelo Vanzin
Priority: Minor
Spark can generate a lot of logs when running in YARN mode. The problem is
already a little bad with normal configuration, but is even worse with dynamic
allocation on.
The first problem is that for every executor Spark launches, it will print the
whole command and all the env variables it's setting, even though those are
exactly the same for every executor. That's not too bad with a handful of
executors, but get annoying pretty soon after that. Dynamic allocation makes
that problem worse since there executors constantly being started and then
going away.
Also, there's a lot of logging generated by the dynamic allocation backend code
in the YARN module. We should audit those and make sure they all make sense,
and whether / how to reduce the amount of logging.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]