lewismc commented on code in PR #906:
URL: https://github.com/apache/nutch/pull/906#discussion_r2920120575
##########
src/java/org/apache/nutch/indexer/IndexingJob.java:
##########
@@ -143,6 +145,8 @@ public void index(Path crawlDb, Path linkDb, List<Path>
segments,
+ RANDOM.nextInt());
FileOutputFormat.setOutputPath(job, tmp);
+ // Driver-level error tracking: categorization + LOG.error only (no job
counters; see ErrorTracker Javadoc).
Review Comment:
I've put this comment in here for the time being. Ultimately I think it is
fine to track driver-level errors in memory (via errorTracker) even though they
are not written to the MapReduce counter(s). I've documented this behavior in
ErrorTracker.java.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]