At http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/ClusterSetup.html#Monitoring_Health_of_NodeManagers is a description of how you can have a script check the health of a node and indicate to the ResourceManager that it is unhealthy. This seems to be at the cluster level. Is there still job level blacklisting as there was in earlier versions?
Chris Mawata
