I want to have the ability to add an html class to a div like <div
class="donotparse" and have tika ignore it.

I know about the
http://tika.apache.org/1.2/api/index.html?org/apache/tika/parser/html/HtmlParser.html
But that only lets me exclude for a given element name. In my case I do not
know the element name but I know a class applied to that element.

What is the right way to do this?

Reply via email to