Hi Vangelis, In Nutch 2.x we use partitioner for distrubiting urls. in reduce of generatorjob we take only topN/recude count urls. We don't choose random by default but we don't take with highest score.
Am i wrong Sebastian ? Talat 22 May 2014 18:59 tarihinde "Vangelis karv" <[email protected]> yazdı: > (Apache Nutch 2.2.1) > > Hi again! > GeneratorJob marks the best topN sites for fetching. Does it choose Urls > with the highest score or random Urls? If it chooses randomly, then whats > the point of the score field?? > Thank you! > >

