[
https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17285829#comment-17285829
]
Luís Filipe Nassif commented on TIKA-3290:
------------------------------------------
Currently I think it cannot be set in xml. If you are calling Tika from java,
the code would be something like:
{code:java}
ParseContext context = new ParseContext();
MimeConfig mimeConfig = new MimeConfig;
mimeConfig.setHeadlessParsing("text/plain");
context.set(MimeConfig.class, mimeConfig);
parser.parse(inputStream, contentHandler, metadata, context);{code}
This will apply to all messages.
Or you can override current message/rfc822 definition with the old definition
which is better for your dataset from your perspective. Take a look at
http://tika.apache.org/1.25/parser_guide.html#Add_your_MIME-Type
> Extension reading it as eml instead of txt
> ------------------------------------------
>
> Key: TIKA-3290
> URL: https://issues.apache.org/jira/browse/TIKA-3290
> Project: Tika
> Issue Type: Bug
> Components: core, mime
> Affects Versions: 1.25
> Reporter: Vamsi Molli
> Priority: Major
> Labels: tika-parsers
> Fix For: 1.24.1
>
> Attachments: test_sample_message.txt
>
>
> The attached file extension is reading it as eml instead of txt. With version
> 1.24.1 it is reading it as txt and now with the upgrade to 1.25, it is
> reading it as eml. So that while parsing we are getting mail corrupted error.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)