Can you paste some examples of the URL you think are being fetched.

It's hard to figure what we're looking for.
Also, what version of Nutch are you using. A possibility can be that it the
link(s) in question are gotten thru a redirect...which does not go thru the
URLfilter.
  

-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] 
Sent: Saturday, April 16, 2005 10:05 AM
To: [email protected]
Subject: UrlFilter Regex - Need Help?

Hi 

Can anyone assist me with why URL's are still being fetched which (i think)
match the following regex entries:?

-http:\/\/.*\/.*\/.*\/.*\/.*
[NEWLINE] (E-mail client may distort)
-.*\.\..*
[NEWLINE] (E-mail client may distort)
-http:\/\/.*\/.*(print|friend|email|emailto|register|signin|login|logon|sign
mein|menus|Print|Friend|Email|Emailto|Register|Signin|Login|Logon|Signmein|M
enus).*
[NEWLINE] (E-mail client may distort)

Can any1 please help me?

Thanks

_____________________________________________________________________
For super low premiums, click here http://www.dialdirect.co.za/quote


Reply via email to