Hi Vangelis,

In Nutch 2.x we use partitioner for distrubiting urls. in reduce of
generatorjob we take only topN/recude count urls. We don't choose random by
default but we don't take with highest score.

Am i wrong Sebastian ?
Talat
22 May 2014 18:59 tarihinde "Vangelis karv" <[email protected]> yazdı:

> (Apache Nutch 2.2.1)
>
> Hi again!
> GeneratorJob marks the best topN sites for fetching. Does it choose Urls
> with the highest score or random Urls? If it chooses randomly, then whats
> the point of the score field??
> Thank you!
>
>

Reply via email to