Hi,

Is there a way to tell nutch to ignore the navigation or footer parts of an 
html page during the crawl process?  Specifically I do not want the information 
in the navigation or footer to be indexed.  My environment is Windows 7 with 
Cygwin, Java 1.7, nutch 1.9 (binary not source) and solr 4.7.

Any assistance will be greatly appreciated.

Thanks,
Jackie

Reply via email to