As a start, I'm able to crawl websites and index the entire content to Solr.

But, I want to index only specific content between certain HTML tags instead
of the whole page.

So, to achieve this, what should I use and how? Parser or filter. 

I browsed through the mailing archives and a lot of blogs but couldn't find
any suitable methods of doing do.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/HTML-tag-filtering-or-parsing-tp4156126.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to