Yes I have a similar problem. I just want to extract the links, count them (internal,external) and ad them with the indexer to solr. I didn't found a solution yet. Is here any advance on your site for this problem?
-- View this message in context: http://lucene.472066.n3.nabble.com/Changing-html-indexing-content-tp1917424p2968459.html Sent from the Nutch - User mailing list archive at Nabble.com.

