[ 
https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783283#action_12783283
 ] 

Andrzej Bialecki  commented on NUTCH-770:
-----------------------------------------

I propose to change the name of this functionality - "timebomb" is not 
self-explanatory, and it suggests that if you misbehave then your cluster may 
explode ;) Instead I would use "time limit", rename all vars and methods to 
follow this naming, and document it properly in nutch-default.xml.

A few comments to the patch:

* it has some overlap with NUTCH-769 (the emptyQueue() method), but that's easy 
to resolve, see also the next point.

* why change the code in FetchQueues at all? Time limit is a global condition, 
we could just break the main loop in run() and ignore the QueueFeeder (or don't 
start it if the time limit already passed when starting run() ).

* the patch does not follow the code style (notably whitespace in for/while 
loops and assignments).

> Timebomb for Fetcher
> --------------------
>
>                 Key: NUTCH-770
>                 URL: https://issues.apache.org/jira/browse/NUTCH-770
>             Project: Nutch
>          Issue Type: Improvement
>            Reporter: Julien Nioche
>         Attachments: log-770, NUTCH-770.patch
>
>
> This patch provides the Fetcher with a timebomb mechanism. By default the 
> timebomb is not activated; it can be set using the parameter 
> fetcher.timebomb.mins. The number of minutes is relative to the start of the 
> Fetch job. When the number of minutes is reached, the QueueFeeder skips all 
> remaining entries then all active queues are purged. This allows to keep the 
> Fetch step under comtrol and works well in combination with NUTCH-769

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to