[
https://issues.apache.org/jira/browse/NUTCH-3142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18056785#comment-18056785
]
Hudson commented on NUTCH-3142:
-------------------------------
SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #218 (See
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/218/])
NUTCH-3142 Add Error Context to Metrics (#882) (github:
[https://github.com/apache/nutch/commit/1242e22ba44ac743de76c39a35270427c2fee1e4])
* (edit) src/java/org/apache/nutch/crawl/Injector.java
* (edit) src/java/org/apache/nutch/crawl/Generator.java
* (edit) src/java/org/apache/nutch/fetcher/FetcherThread.java
* (edit) ivy/ivy.xml
* (edit) src/java/org/apache/nutch/hostdb/ResolverThread.java
* (edit) src/java/org/apache/nutch/hostdb/UpdateHostDbMapper.java
* (add) src/java/org/apache/nutch/metrics/ErrorTracker.java
* (edit) src/java/org/apache/nutch/crawl/CrawlDbReducer.java
* (edit) src/java/org/apache/nutch/indexer/IndexerMapReduce.java
* (edit) src/java/org/apache/nutch/util/SitemapProcessor.java
* (add) src/test/org/apache/nutch/metrics/TestErrorTracker.java
* (edit) src/java/org/apache/nutch/parse/ParseSegment.java
* (edit) src/java/org/apache/nutch/tools/warc/WARCExporter.java
* (edit) src/java/org/apache/nutch/metrics/NutchMetrics.java
> Add Error Context to Metrics
> ----------------------------
>
> Key: NUTCH-3142
> URL: https://issues.apache.org/jira/browse/NUTCH-3142
> Project: Nutch
> Issue Type: Sub-task
> Components: metrics
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Major
> Fix For: 1.22
>
>
> Current error metrics lack granularity. While protocol status codes are
> tracked dynamically, there's no categorization of:
> * HTTP error codes (4xx vs 5xx)
> * Exception types (timeout, connection refused, DNS failure)
> * Parse failure reasons
> This makes it difficult to diagnose crawl issues from metrics alone e.g.
> necessitating the interrogation of logs, adding complexity to
> troubleshooting.
> This ticket will add new error context metrics for FetcherThread,
> ParseSegment and IndexerMapReduce.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)