ok so it won't crawl any links for the download.html location.
But How is this  +^http://([a-z0-9]*\.)*nutch.apache.org/ in regex-filter
crawl all the links on the homepage
whereas this +^http://([a-z0-9]*\.)*nutch.apache.org/downloads.html won't
crawl anything downloads.html page.I am sorry his could be a simple basic
question but I am really stuck at this.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Crawl-and-Index-specific-links-on-specific-page-tp4106524p4106586.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to