Thanks all! It is running again and seems to be doing a lot more.
On 7/26/05, Howie Wang <[EMAIL PROTECTED]> wrote: > I think Praveen is right. Another thing that you might have to > look out for is that most of the links on theserverside seem to > have query strings in them with a '?'. So you should move this line: > > +^http://([a-z0-9]*\.)*theserverside.com/ > > Before this line: > > # skip URLs containing certain characters as probable queries, etc. > [EMAIL PROTECTED] > > The regex's are evaluated in order so you're currently going to filter > out most of the articles now. > > > ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
