[ 
https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17280660#comment-17280660
 ] 

Luís Filipe Nassif commented on TIKA-3290:
------------------------------------------

I agree with [~nick] this sample seems an email without its headers, so the 
detection looks better now.

MimeConfig class has methods setHeadlessParsing(...) and 
setMalformedHeaderStartsBody(...) but they probably will degrade parsing of 
other emails if enabled by default.

If you are calling from java, you can send a custom MimeConfig object in 
parseContext parse param, after setting 
mimeConfig.setHeadlessParsing("text/plain"), so I think these headless messages 
could be parsed without problems.

> Extension reading it as eml instead of txt
> ------------------------------------------
>
>                 Key: TIKA-3290
>                 URL: https://issues.apache.org/jira/browse/TIKA-3290
>             Project: Tika
>          Issue Type: Bug
>          Components: core, mime
>    Affects Versions: 1.25
>            Reporter: Vamsi Molli
>            Priority: Major
>              Labels: tika-parsers
>             Fix For: 1.24.1
>
>         Attachments: test_sample_message.txt
>
>
> The attached file extension is reading it as eml instead of txt. With version 
> 1.24.1 it is reading it as txt and now with the upgrade to 1.25, it is 
> reading it as eml. So that while parsing we are getting mail corrupted error.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to