On Thu, 28 Jul 2016, Vjeran Marcinko wrote:
Just as I resolved the rpoblem with MBOX parser, I noticed that it
doesn't correctly detect contained RFC822 messages as message/rfc822,
but usually text/html or some variation of it.
And question as before, is there some workaround for 1.13 to place in
custom-mimetypes.xml that would fix this?
Can you create a small junit testcase that shows the problem, using either
a small mbox file of your own, or one of the ones in the tika-parsers test
documents directory? Attach that to a new JIRA issue, and one of us can
use it to take a look at what's going wrong. Once we know the underlying
issue, we can hopefully fix it, and maybe let you know a workaround!
Nick