[
https://issues.apache.org/jira/browse/NUTCH-2867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17447485#comment-17447485
]
Hudson commented on NUTCH-2867:
-------------------------------
SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #51 (See
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/51/])
NUTCH-2867 Support for custom HostDb aggregators (snagel:
[https://github.com/apache/nutch/commit/1e7eb52f1070a1a66bfc6a1bc84731d6653fba84])
* (edit) src/java/org/apache/nutch/hostdb/UpdateHostDb.java
* (add) src/java/org/apache/nutch/hostdb/FetchOverdueCrawlDatumProcessor.java
* (edit) src/java/org/apache/nutch/hostdb/UpdateHostDbReducer.java
* (add) src/java/org/apache/nutch/hostdb/AbstractCrawlDatumProcessor.java
NUTCH-2867 Support for custom HostDb aggregators (snagel:
[https://github.com/apache/nutch/commit/ad44f55c494467bda43e6babf48ced24ba977072])
* (edit) src/java/org/apache/nutch/hostdb/UpdateHostDbReducer.java
* (edit) conf/nutch-default.xml
* (edit) src/java/org/apache/nutch/hostdb/FetchOverdueCrawlDatumProcessor.java
* (edit) src/java/org/apache/nutch/hostdb/AbstractCrawlDatumProcessor.java
NUTCH-2867 Support for custom HostDb aggregators (snagel:
[https://github.com/apache/nutch/commit/9909a61e35a645eebee642eef463d27db568d96d])
* (add) src/java/org/apache/nutch/hostdb/CrawlDatumProcessor.java
* (delete) src/java/org/apache/nutch/hostdb/AbstractCrawlDatumProcessor.java
* (edit) src/java/org/apache/nutch/hostdb/UpdateHostDbReducer.java
* (edit) src/java/org/apache/nutch/hostdb/FetchOverdueCrawlDatumProcessor.java
NUTCH-2867 Support for custom HostDb aggregators (snagel:
[https://github.com/apache/nutch/commit/5f6f62754624526d63db886fd6503e4e6e4849d5])
* (edit) conf/nutch-default.xml
NUTCH-2867 Support for custom HostDb aggregators (snagel:
[https://github.com/apache/nutch/commit/ebf3036ab51bf97f50207691fd9df1d3a77563b3])
* (edit) src/java/org/apache/nutch/hostdb/CrawlDatumProcessor.java
> Support for custom HostDb aggregators
> -------------------------------------
>
> Key: NUTCH-2867
> URL: https://issues.apache.org/jira/browse/NUTCH-2867
> Project: Nutch
> Issue Type: Improvement
> Components: hostdb
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Priority: Major
> Fix For: 1.19
>
> Attachments: NUTCH-2867-1.patch, NUTCH-2867.patch
>
>
> HostDB needs support for custom per-host statistic aggregators. This gives
> users a simple tool to calculate their own statistics just by implementing a
> simple interface, and configurating that class as a
> hostdb.crawldatum.processor.
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)