How can I filter certain pages like Privacy Policies, Terms and conditions etc from crawling, because all these pages contains bogus information. I am new to nutch. Please let me know about this.
Thanks in Advance. -- View this message in context: http://old.nabble.com/Filtering-Pages-while-crawling-tp26395359p26395359.html Sent from the Nutch - Dev mailing list archive at Nabble.com.