[
https://issues.apache.org/jira/browse/MAPREDUCE-746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12729570#action_12729570
]
Iyappan Srinivasan commented on MAPREDUCE-746:
----------------------------------------------
The job tracker logs just before this null pointer exception. I have replaced
teh anme of killed task tracker server name with this string "<killed TT>"
"2009-07-10 07:51:43,626 INFO org.apache.hadoop.mapred.JobTracker: Adding
<killed TT> to the blacklist across all jobs
2009-07-10 07:51:43,628 INFO org.apache.hadoop.mapred.JobTracker: Blacklisting
tracker : <killed TT> Reason for blacklisting is : EXCEEDING_FAILURES
2009-07-10 07:51:43,631 INFO org.apache.hadoop.ipc.Server: IPC Server handler 0
on 9020, call heartbeat(org.apache.hadoop.mapred.tasktrackersta...@1fb050c,
false, false, true, 269) from 98.136.99.54:55815: error: java.io.IOException:
java.lang.NullPointerException
java.io.IOException: java.lang.NullPointerException"
> When a task tracker is killed, there is a Null Pointer exception thrown.
> -------------------------------------------------------------------------
>
> Key: MAPREDUCE-746
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-746
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: jobtracker
> Reporter: Iyappan Srinivasan
> Priority: Minor
>
> When a task tracker is killed, the job completes. But tehre is a null pointer
> exception thrown:
> java.io.IOException: java.lang.NullPointerException
> at
> org.apache.hadoop.mapred.JobTracker$FaultyTrackersInfo.removeHostCapacity(JobTracker.java:759)
> at
> org.apache.hadoop.mapred.JobTracker$FaultyTrackersInfo.blackListTracker(JobTracker.java:624)
> at
> org.apache.hadoop.mapred.JobTracker$FaultyTrackersInfo.incrementFaults(JobTracker.java:601)
> at org.apache.hadoop.mapred.JobTracker.finalizeJob(JobTracker.java:2337)
> at
> org.apache.hadoop.mapred.JobInProgress.garbageCollect(JobInProgress.java:2998)
> at
> org.apache.hadoop.mapred.JobInProgress.jobComplete(JobInProgress.java:2584)
> at
> org.apache.hadoop.mapred.JobInProgress.completedTask(JobInProgress.java:2473)
> at
> org.apache.hadoop.mapred.JobInProgress.updateTaskStatus(JobInProgress.java:1047)
> at
> org.apache.hadoop.mapred.JobTracker.updateTaskStatuses(JobTracker.java:3867)
> at
> org.apache.hadoop.mapred.JobTracker.processHeartbeat(JobTracker.java:3079)
> at org.apache.hadoop.mapred.JobTracker.heartbeat(JobTracker.java:2817)
> at sun.reflect.GeneratedMethodAccessor4.invoke(Unknown Source)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
> at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:964)
> at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:960)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at org.apache.hadoop.ipc.Server$Handler.run(Server.java:958)
> Steps to reproduce the issue:
> 1) Bring up a 5 node cluster.
> 2) set mapred.max.tracker.failures to 1
> 3) Run a sleep command with 5 maps and 5 redcues.
> 4) Kill a task tracker, when map is 80% complete.
> 5) Kill the task tracker, by using "kill -9 process_id".
> 6) At the time of killing, it was running 2 maps and 2 reduces.
> 7) Allow 12 minutes to elapse for that task tracker to go out of job nodes
> list
> 8) Then job completes successfully by giving the task attempts of the lost
> trackers to other nodes.
> 9) The job tracker logs has this exception.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.