Nutch uses regular-expression-based url filtering by default. These are specified in a config file.

Doug

GWW wrote:
Hello
Can I decide about what kind urls I want to fetch ?
e.g I don't want to fetch urls inluding some string.
How can I aim to it ?
regards Ryboslaw


-------------------------------------------------------
This SF.Net email is sponsored by OSTG. Have you noticed the changes on
Linux.com, ITManagersJournal and NewsForge in the past few weeks? Now,
one more big change to announce. We are now OSTG- Open Source Technology
Group. Come see the changes on the new OSTG site. www.ostg.com
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to