Sorry, I think the line '-.' isn't gooo suggestion. Please remove it...
[EMAIL PROTECTED] wrotte:
Hello,
In regex-urlfilter.txt (or crawl-urlfiltter.txt if you crawl).
-.
+^http:/intranet/development/pdffiles/
Make sure, the urlfilter-regex plugin is incuded in nutch-site.xml or
in nutch-default.xml.
Regards,
Ferenc
Clint Cagle wrotte:
How do I enable nutch only to search one directory on an intranet?
For example,
http:/intranet/development/pdffiles/
-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general