[ 
https://issues.apache.org/jira/browse/NUTCH-2867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446670#comment-17446670
 ] 

Sebastian Nagel commented on NUTCH-2867:
----------------------------------------

Hi [~markus17], great! Successfully tested the new 
FetchOverdueCrawlDatumProcessor! I've tried to clean up the code somewhat:
- are you ok with the description of the property and code formatting in 
[ad44f55|https://github.com/sebastian-nagel/nutch/commit/ad44f55c494467bda43e6babf48ced24ba977072]?
- and what about renaming the new aggregator interface into 
"CrawlDatumProcessor", stripping "Abstract" as it isn't an abstract class and 
the shorter name sounds better 
([9909a61|https://github.com/sebastian-nagel/nutch/commit/9909a61e35a645eebee642eef463d27db568d96d])?

> Support for custom HostDb aggregators
> -------------------------------------
>
>                 Key: NUTCH-2867
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2867
>             Project: Nutch
>          Issue Type: Improvement
>          Components: hostdb
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Major
>             Fix For: 1.19
>
>         Attachments: NUTCH-2867-1.patch, NUTCH-2867.patch
>
>
> HostDB needs support for custom per-host statistic aggregators. This gives 
> users a simple tool to calculate their own statistics just by implementing a 
> simple interface, and configurating that class as a 
> hostdb.crawldatum.processor.
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to