Hi, 

 

I am using Nutch-1.0 manly for crawling. 

 

I want to generate Segments with a fixed size eg. 1000 urls. But the
Segment should only contain uncrawled urls and urls which have been
waiting longest for recrawling.

 

Can anyone give me a hint where I should tackle the problem?

 

Thanks a lot

 

Tom

Reply via email to