Hello, In regex-urlfilter.txt (or crawl-urlfiltter.txt if you crawl). -. +^http:/intranet/development/pdffiles/
Make sure, the urlfilter-regex plugin is incuded in nutch-site.xml or in nutch-default.xml.
Regards, Ferenc Clint Cagle wrotte:
How do I enable nutch only to search one directory on an intranet? For example, http:/intranet/development/pdffiles/
