Szilard Nemeth created YARN-10323:
-------------------------------------

             Summary: [Umbrella] YARN Debuggability and Supportability 
Improvements
                 Key: YARN-10323
                 URL: https://issues.apache.org/jira/browse/YARN-10323
             Project: Hadoop YARN
          Issue Type: Improvement
            Reporter: Szilard Nemeth


Troubleshooting YARN problems can be difficult on a production environment.
Collecting data before problems occur or actively collecting data in an 
on-demand basis could truly help tracking down issues.

Some examples: 
1. If application is hanging, application logs along with RM / NM logs could be 
collected, plus jstack of either the YARN daemons or the application container.
2. Similarly, when an application fails we may collect data.
3. Scheduler issues are quite common so good tooling that helps to spot issues 
would be crucial.

Design document will be added later.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to