Unfortunately,

-^http://my.domain.name/dir/$

didn't work for me. I need to skip just the documents in the directory, but
this skips all the subdirectories as well. Is there another solution, or
possibly some way to go back and remove all the parent directories after the
crawl?

Thanks for your help.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Prevent-crawl-of-parent-URL-tp4080032p4083287.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to