On Tue, 17 Apr 2012, William Hays wrote:
I believe you answered a different question than what I asked. My observation was specifically about the AutoDetectParser listing its supported mediatypes, not about the HTMLParser.

The Tika App uses AutoDetectParser internally, so if it's finding the parser and the mimetypes then they should be correctly defined.

I wonder if perhaps in your application if you're missing some of the classes that the HtmlParser depends on? Parsers will only show up if they can be loaded correctly

As a general rule, it's often worth checking something with Tika App when your code misbehaves, as it can help diferentiate between Tika bugs and errors in the setup of Tika in your code

Nick

Reply via email to