Thanks Markus for your reply.

Can you help me out with some of the regex-filter patterns.
What can be the pattern if we want to crawl say .txt or .avi file on page
say http://nutch.apache.org/downloads.html

Is this work  +^http://([a-z0-9]*\.)*nutch.apache.org/downloads.html  



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Crawl-and-Index-specific-links-on-specific-page-tp4106524p4106581.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to