On Wed, 5 Oct 2016, Ingo Siebert wrote:
I just used Tika (org.apache.tika:tika-parsers:1.13) to parse an e-mail with
multipart/mixed content.
How do you want to get the various parts back? All text inlined, or a
special callback for each part? What about the metadata for the parts?
The parsing result of Tika is the file in plain text including all headers an
boundary elements.
The words in the attachment are also not parsed.
Is this the defined behaviour of Tika?
It is if you don't tell Tika to recurse into embedded resources
Nick