This may be a dumb question, but can you accomplish the same thing by placing the following code in mapred-site.xml. Or did I misunderstand the fix...
<property> <name>mapred.skip.attempts.to.start.skipping</name> <value>2</value> <!-- default: 2 --> <description> The number of Task attempts AFTER which skip mode will be kicked off. When skip mode is kicked off, the tasks reports the range of records which it will process next, to the TaskTracker. So that on failures, TT knows which ones are possibly the bad records. On further executions, those are skipped. </description> </property> <property> <name>mapred.skip.map.max.skip.records</name> <value>1</value> <!-- default: 0 --> <description> The number of acceptable skip records surrounding the bad record PER bad record in mapper. The number includes the bad record as well. To turn the feature of detection/skipping of bad records off, set the value to 0. The framework tries to narrow down the skipped range by retrying until this threshold is met OR all attempts get exhausted for this task. Set the value to Long.MAX_VALUE to indicate that framework need not try to narrow down. Whatever records(depends on application) get skipped are acceptable. </description> </property> Brad