RE: Crawl and Index specific links on specific page

anish_88 Fri, 13 Dec 2013 04:35:09 -0800

ok so it won't crawl any links for the download.html location.
But How is this  +^http://([a-z0-9]*\.)*nutch.apache.org/ in regex-filter
crawl all the links on the homepage
whereas this +^http://([a-z0-9]*\.)*nutch.apache.org/downloads.html won't
crawl anything downloads.html page.I am sorry his could be a simple basic
question but I am really stuck at this.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Crawl-and-Index-specific-links-on-specific-page-tp4106524p4106586.html
Sent from the Nutch - User mailing list archive at Nabble.com.

RE: Crawl and Index specific links on specific page

Reply via email to