Thanks Markus for your reply. Can you help me out with some of the regex-filter patterns. What can be the pattern if we want to crawl say .txt or .avi file on page say http://nutch.apache.org/downloads.html
Is this work +^http://([a-z0-9]*\.)*nutch.apache.org/downloads.html -- View this message in context: http://lucene.472066.n3.nabble.com/Crawl-and-Index-specific-links-on-specific-page-tp4106524p4106581.html Sent from the Nutch - User mailing list archive at Nabble.com.

