Hi,
I am using Nutch-1.0 manly for crawling. I want to generate Segments with a fixed size eg. 1000 urls. But the Segment should only contain uncrawled urls and urls which have been waiting longest for recrawling. Can anyone give me a hint where I should tackle the problem? Thanks a lot Tom