[ 
https://issues.apache.org/jira/browse/NUTCH-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on NUTCH-3155 stopped by Lewis John McGibbney.
---------------------------------------------------
> Add ErrorTracker to remaining MapReduce jobs missing error metrics
> ------------------------------------------------------------------
>
>                 Key: NUTCH-3155
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3155
>             Project: Nutch
>          Issue Type: Sub-task
>          Components: metrics
>    Affects Versions: 1.22
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Major
>             Fix For: 1.23
>
>
> A codebase-wide sweep of all MapReduce Mapper and Reducer classes identified 
> six jobs that catch exceptions during processing but do not use ErrorTracker 
> to emit categorized errors_* Hadoop counters. Errors in these jobs are logged 
> but invisible in job metrics.
> {*}Affected classes{*}:
>  * LinkDb.LinkDbMapper
>  * LinkDbFilter
>  * CrawlDbReader.CrawlDbStatMapper
>  * SegmentMerger.SegmentMergerMapper
>  * ReadHostDb.ReadHostDbMapper, and 
>  * UpdateHostDbReducer.
> The ticket is scoped to add ErrorTracker (with the cached constructor for 
> consistent counter registration) to each class and implement it within 
> existing catch blocks.
> This will also require new counter group constants in NutchMetrics.java.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to