Clay McDonald created YARN-2095:
-----------------------------------
Summary: Large MapReduce Job stops responding
Key: YARN-2095
URL: https://issues.apache.org/jira/browse/YARN-2095
Project: Hadoop YARN
Issue Type: Bug
Affects Versions: 2.2.0
Environment: CentOS 6.3 (x86_64) on vmware 10 running HDP-2.0.6
Reporter: Clay McDonald
Priority: Blocker
Very large jobs (7,455 Mappers and 999 Reducers) hang. Jobs run well but
logging to container logs stop after running 33 hours. The job appears to be
hung. The status of the job is "RUNNING". No error messages found in logs.
--
This message was sent by Atlassian JIRA
(v6.2#6252)