[ https://issues.apache.org/jira/browse/YARN-2095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vinod Kumar Vavilapalli resolved YARN-2095. ------------------------------------------- Resolution: Invalid [~sunliners81], we have run much bigger jobs (100K maps) and those that run for long time without any issues. There is only one limitation that I know of - in secure clusters tokens expire after 7 days. In any case, please pursue this on user mailing lists and create a bug when you are sure there is one. Closing this as invalid for now, please reopen if you disagree. > Large MapReduce Job stops responding > ------------------------------------ > > Key: YARN-2095 > URL: https://issues.apache.org/jira/browse/YARN-2095 > Project: Hadoop YARN > Issue Type: Bug > Affects Versions: 2.2.0 > Environment: CentOS 6.3 (x86_64) on vmware 10 running HDP-2.0.6 > Reporter: Clay McDonald > Priority: Blocker > > Very large jobs (7,455 Mappers and 999 Reducers) hang. Jobs run well but > logging to container logs stop after running 33 hours. The job appears to be > hung. The status of the job is "RUNNING". No error messages found in logs. -- This message was sent by Atlassian JIRA (v6.2#6252)