firegun created MAPREDUCE-5606: ---------------------------------- Summary: JobTracker blocked for DFSClient: Failed recovery attempt Key: MAPREDUCE-5606 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5606 Project: Hadoop Map/Reduce Issue Type: Bug Components: jobtracker Affects Versions: 1.0.3 Environment: centos 5.8 jdk 1.7 Reporter: firegun Priority: Critical
when a datanode was crash,the server can ping ok,but can not call rpc ,and also can not ssh login. and then jobTracker may be request a block on this datanode. it will happened ,the JobTracker can not work,the webUI is also unwork,hadoop job -list also unwork,the jobTracker logs no other info . and then we need to restart the datanode. then jobTraker can work too,but the taskTracker num come to zero, we need run : hadoop mradmin -refreshNodes then the JobTracker begin to add taskTraker ,but is very slowly. this problem occur 5time in 2weeks. -- This message was sent by Atlassian JIRA (v6.1#6144)