ok so it won't crawl any links for the download.html location. But How is this +^http://([a-z0-9]*\.)*nutch.apache.org/ in regex-filter crawl all the links on the homepage whereas this +^http://([a-z0-9]*\.)*nutch.apache.org/downloads.html won't crawl anything downloads.html page.I am sorry his could be a simple basic question but I am really stuck at this.
-- View this message in context: http://lucene.472066.n3.nabble.com/Crawl-and-Index-specific-links-on-specific-page-tp4106524p4106586.html Sent from the Nutch - User mailing list archive at Nabble.com.

