[ https://issues.apache.org/jira/browse/YARN-10323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Szilard Nemeth updated YARN-10323: ---------------------------------- Summary: [Umbrella] YARN Diagnostic collector (was: [Umbrella] YARN Debuggability and Supportability Improvements) > [Umbrella] YARN Diagnostic collector > ------------------------------------ > > Key: YARN-10323 > URL: https://issues.apache.org/jira/browse/YARN-10323 > Project: Hadoop YARN > Issue Type: Improvement > Reporter: Szilard Nemeth > Priority: Major > > Troubleshooting YARN problems can be difficult on a production environment. > Collecting data before problems occur or actively collecting data in an > on-demand basis could truly help tracking down issues. > Some examples: > 1. If application is hanging, application logs along with RM / NM logs could > be collected, plus jstack of either the YARN daemons or the application > container. > 2. Similarly, when an application fails we may collect data. > 3. Scheduler issues are quite common so good tooling that helps to spot > issues would be crucial. > Design document will be added later. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org