RE: Prevent crawl of parent URL

stone2dbone Thu, 08 Aug 2013 06:10:30 -0700

Unfortunately,

-^http://my.domain.name/dir/$


didn't work for me. I need to skip just the documents in the directory, but
this skips all the subdirectories as well. Is there another solution, or
possibly some way to go back and remove all the parent directories after the
crawl?

Thanks for your help.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Prevent-crawl-of-parent-URL-tp4080032p4083287.html
Sent from the Nutch - User mailing list archive at Nabble.com.

RE: Prevent crawl of parent URL

Reply via email to