Hi nutch dev, After fetching about 100 mio of pages I see many search engine spammers that use an hidden div tag (negative position) to include many urls that user don't see whe acces the site page. This links alter the boost (by inlink count) so I want to skip this urls. How can I do that?
Thanks, Massimo