[
https://issues.apache.org/jira/browse/NUTCH-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-3155 stopped by Lewis John McGibbney.
---------------------------------------------------
> Add ErrorTracker to remaining MapReduce jobs missing error metrics
> ------------------------------------------------------------------
>
> Key: NUTCH-3155
> URL: https://issues.apache.org/jira/browse/NUTCH-3155
> Project: Nutch
> Issue Type: Sub-task
> Components: metrics
> Affects Versions: 1.22
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Major
> Fix For: 1.23
>
>
> A codebase-wide sweep of all MapReduce Mapper and Reducer classes identified
> six jobs that catch exceptions during processing but do not use ErrorTracker
> to emit categorized errors_* Hadoop counters. Errors in these jobs are logged
> but invisible in job metrics.
> {*}Affected classes{*}:
> * LinkDb.LinkDbMapper
> * LinkDbFilter
> * CrawlDbReader.CrawlDbStatMapper
> * SegmentMerger.SegmentMergerMapper
> * ReadHostDb.ReadHostDbMapper, and
> * UpdateHostDbReducer.
> The ticket is scoped to add ErrorTracker (with the cached constructor for
> consistent counter registration) to each class and implement it within
> existing catch blocks.
> This will also require new counter group constants in NutchMetrics.java.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)