Lewis John McGibbney created NUTCH-3142:
-------------------------------------------

             Summary: Add Error Context to Metrics
                 Key: NUTCH-3142
                 URL: https://issues.apache.org/jira/browse/NUTCH-3142
             Project: Nutch
          Issue Type: Sub-task
          Components: metrics
            Reporter: Lewis John McGibbney
            Assignee: Lewis John McGibbney
             Fix For: 1.22


Current error metrics lack granularity. While protocol status codes are tracked 
dynamically, there's no categorization of:
 * HTTP error codes (4xx vs 5xx)
 * Exception types (timeout, connection refused, DNS failure)
 * Parse failure reasons

This makes it difficult to diagnose crawl issues from metrics alone e.g. 
necessitating the interrogation of logs, adding complexity to troubleshooting. 

This ticket will add new error context metrics for FetcherThread, ParseSegment 
and IndexerMapReduce. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to