Thanks all!

It is running again and seems to be doing a lot more.  

On 7/26/05, Howie Wang <[EMAIL PROTECTED]> wrote:
> I think Praveen is right. Another thing that you might have to
> look out for is that most of the links on theserverside seem to
> have query strings in them with a '?'. So you should move this line:
> 
> +^http://([a-z0-9]*\.)*theserverside.com/
> 
> Before this line:
> 
> # skip URLs containing certain characters as probable queries, etc.
> [EMAIL PROTECTED]
> 
> The regex's are evaluated in order so you're currently going to filter
> out most of the articles now.
> 
> 
>


-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to