For each container being activated, the slider agent knows the log
location. If the component instance (within the container) fails to
activate properly or goes down then the user may need access to the log
locations for debugging. *This assumes that appropriate yarn config is in
place to have the log files hang around for a while.*
What do you think about the capability to have the agent report back these
information (log locations etc.) and have them published. There needs to be
a limit on how long the information should be available as number of failed
containers may grow over time. An obvious limit could be the yarn
configuration of how long the logs are saved on the host.
Should we publish them as:
{conatiner_id: {
"hostname":"name of the host",
"agent_log": "folder path to agent log",
"app_log": "folder path to app log" }}
We can add more diagnostics information as needed.
Any suggestion on a good location to publish this?
-Sumit
--
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to
which it is addressed and may contain information that is confidential,
privileged and exempt from disclosure under applicable law. If the reader
of this message is not the intended recipient, you are hereby notified that
any printing, copying, dissemination, distribution, disclosure or
forwarding of this communication is strictly prohibited. If you have
received this communication in error, please contact the sender immediately
and delete it from your system. Thank You.