Hi Vangelis,

> Does it choose Urls with the highest score
Yes, it does. Have a look at generatorSortValue(...) in one the scoring filter 
plugins.
In case of scoring-opic (activated per default), URLs/docs are simply ranked by 
score
taken from CrawlDb. But other scoring filters may use different strategies to 
rank
and select URLs for fetching. And of course, you are able to adapt it to your 
own needs
by writing a new scoring filter. Finally, scoring filters can be combined by 
chaining:
the initSort parameter is the value returned by the preceding scoring filter.

Sebastian

On 05/22/2014 05:59 PM, Vangelis karv wrote:
> (Apache Nutch 2.2.1)
> 
> Hi again!
> GeneratorJob marks the best topN sites for fetching. Does it choose Urls with 
> the highest score or random Urls? If it chooses randomly, then whats the 
> point of the score field?? 
> Thank you!
> 
>                                         
> 

Reply via email to