[
https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17280660#comment-17280660
]
Luís Filipe Nassif commented on TIKA-3290:
------------------------------------------
I agree with [~nick] this sample seems an email without its headers, so the
detection looks better now.
MimeConfig class has methods setHeadlessParsing(...) and
setMalformedHeaderStartsBody(...) but they probably will degrade parsing of
other emails if enabled by default.
If you are calling from java, you can send a custom MimeConfig object in
parseContext parse param, after setting
mimeConfig.setHeadlessParsing("text/plain"), so I think these headless messages
could be parsed without problems.
> Extension reading it as eml instead of txt
> ------------------------------------------
>
> Key: TIKA-3290
> URL: https://issues.apache.org/jira/browse/TIKA-3290
> Project: Tika
> Issue Type: Bug
> Components: core, mime
> Affects Versions: 1.25
> Reporter: Vamsi Molli
> Priority: Major
> Labels: tika-parsers
> Fix For: 1.24.1
>
> Attachments: test_sample_message.txt
>
>
> The attached file extension is reading it as eml instead of txt. With version
> 1.24.1 it is reading it as txt and now with the upgrade to 1.25, it is
> reading it as eml. So that while parsing we are getting mail corrupted error.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)