Unfortunately, -^http://my.domain.name/dir/$
didn't work for me. I need to skip just the documents in the directory, but this skips all the subdirectories as well. Is there another solution, or possibly some way to go back and remove all the parent directories after the crawl? Thanks for your help. -- View this message in context: http://lucene.472066.n3.nabble.com/Prevent-crawl-of-parent-URL-tp4080032p4083287.html Sent from the Nutch - User mailing list archive at Nabble.com.

