Hello,

In regex-urlfilter.txt (or crawl-urlfiltter.txt if you crawl).
-.
+^http:/intranet/development/pdffiles/

Make sure, the urlfilter-regex plugin is incuded in nutch-site.xml or in nutch-default.xml.

Regards,
Ferenc

Clint Cagle wrotte:

How do I enable nutch only to search one directory on an intranet?

For example,
http:/intranet/development/pdffiles/



Reply via email to