Hi,

I am using notch 1.10, and our requirement is to not index footer div in 
content. I applied solution provided in below link , it worked and it removes 
footer div from content before parsing.
But we also want to discover links present in footer div , so basically we 
don’t want to index footer in content but want to crawl links present in footer 
section.

https://issues.apache.org/jira/secure/attachment/12467198/nutch-585-jostens-excludeDIVs.patch
 
<https://issues.apache.org/jira/secure/attachment/12467198/nutch-585-jostens-excludeDIVs.patch>

Please suggest

Thanks

Reply via email to