I want to have the ability to add an html class to a div like <div class="donotparse" and have tika ignore it.
I know about the http://tika.apache.org/1.2/api/index.html?org/apache/tika/parser/html/HtmlParser.html But that only lets me exclude for a given element name. In my case I do not know the element name but I know a class applied to that element. What is the right way to do this?
