Lewis John McGibbney created NUTCH-3142:
-------------------------------------------
Summary: Add Error Context to Metrics
Key: NUTCH-3142
URL: https://issues.apache.org/jira/browse/NUTCH-3142
Project: Nutch
Issue Type: Sub-task
Components: metrics
Reporter: Lewis John McGibbney
Assignee: Lewis John McGibbney
Fix For: 1.22
Current error metrics lack granularity. While protocol status codes are tracked
dynamically, there's no categorization of:
* HTTP error codes (4xx vs 5xx)
* Exception types (timeout, connection refused, DNS failure)
* Parse failure reasons
This makes it difficult to diagnose crawl issues from metrics alone e.g.
necessitating the interrogation of logs, adding complexity to troubleshooting.
This ticket will add new error context metrics for FetcherThread, ParseSegment
and IndexerMapReduce.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)