[
https://issues.apache.org/jira/browse/NUTCH-2867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446670#comment-17446670
]
Sebastian Nagel commented on NUTCH-2867:
----------------------------------------
Hi [~markus17], great! Successfully tested the new
FetchOverdueCrawlDatumProcessor! I've tried to clean up the code somewhat:
- are you ok with the description of the property and code formatting in
[ad44f55|https://github.com/sebastian-nagel/nutch/commit/ad44f55c494467bda43e6babf48ced24ba977072]?
- and what about renaming the new aggregator interface into
"CrawlDatumProcessor", stripping "Abstract" as it isn't an abstract class and
the shorter name sounds better
([9909a61|https://github.com/sebastian-nagel/nutch/commit/9909a61e35a645eebee642eef463d27db568d96d])?
> Support for custom HostDb aggregators
> -------------------------------------
>
> Key: NUTCH-2867
> URL: https://issues.apache.org/jira/browse/NUTCH-2867
> Project: Nutch
> Issue Type: Improvement
> Components: hostdb
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Priority: Major
> Fix For: 1.19
>
> Attachments: NUTCH-2867-1.patch, NUTCH-2867.patch
>
>
> HostDB needs support for custom per-host statistic aggregators. This gives
> users a simple tool to calculate their own statistics just by implementing a
> simple interface, and configurating that class as a
> hostdb.crawldatum.processor.
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)