[ https://issues.apache.org/jira/browse/HADOOP-5719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sreekanth Ramakrishnan updated HADOOP-5719: ------------------------------------------- Attachment: HADOOP-5719-2.patch Attaching patch incorporating most of Vinod's comments. bq. I think a better place for job removal from the JobQueuesManager is the cleanUpInitializedJobsList() method of teh JobInitializationPoller. We may want to rename this method and change its javadoc a bit. This has not been incorporated because of issue described in HADOOP-5020 it is hit when {{JobInProgress.initTasks()}} throws an exception and terminate job is called and Capacity scheduler would never be able to remove the job from the waiting queue. Also added a new test case {{TestJobInitalizationPoller}} which uses {{MiniMRCluster}} to verify if jobs failing initialization are actually removed from waiting queue. > Jobs failed during job initalization are never removed from Capacity > Schedulers waiting list > -------------------------------------------------------------------------------------------- > > Key: HADOOP-5719 > URL: https://issues.apache.org/jira/browse/HADOOP-5719 > Project: Hadoop Core > Issue Type: Bug > Components: contrib/capacity-sched > Reporter: Sreekanth Ramakrishnan > Assignee: Sreekanth Ramakrishnan > Attachments: HADOOP-5719-1.patch, HADOOP-5719-2.patch > > > Jobs which fail during initalization are never removed from Capacity > Schedulers waiting job list. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.