Can you paste some examples of the URL you think are being fetched. It's hard to figure what we're looking for. Also, what version of Nutch are you using. A possibility can be that it the link(s) in question are gotten thru a redirect...which does not go thru the URLfilter.
-----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Saturday, April 16, 2005 10:05 AM To: [email protected] Subject: UrlFilter Regex - Need Help? Hi Can anyone assist me with why URL's are still being fetched which (i think) match the following regex entries:? -http:\/\/.*\/.*\/.*\/.*\/.* [NEWLINE] (E-mail client may distort) -.*\.\..* [NEWLINE] (E-mail client may distort) -http:\/\/.*\/.*(print|friend|email|emailto|register|signin|login|logon|sign mein|menus|Print|Friend|Email|Emailto|Register|Signin|Login|Logon|Signmein|M enus).* [NEWLINE] (E-mail client may distort) Can any1 please help me? Thanks _____________________________________________________________________ For super low premiums, click here http://www.dialdirect.co.za/quote
