Sajid, Check the logs for your container at $HADOOP_INSTALL_DIR/logs/userlogs/<application_id>/<container_id>. Note that these will be present on the node where your Application Master is running.
If the container was not able to start, you might get something by printing the stack trace in onStartContainerError(ContainerId containerId, Throwable t) callback method in the NMClientAsync.CallbackHandler interface. You should always be capturing the exit status of the container inside onContainersCompleted(List<ContainerStatus> completedContainers) callback method of AMRMClientAsync.CallbackHandler interface. Please see the source of Distributed Shell application on github for an example of how this is done - https://github.com/apache/hadoop-common/tree/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell Regards, Rahul Chhiber From: Sajid Syed [mailto:[email protected]] Sent: Thursday, December 18, 2014 9:44 AM To: [email protected] Subject: How to handle Container crash in YARN Hello, Can any please explain me how to handle/Resolve the Container crash in YARN Hadoop. Thanks Sajid Syed
