[
https://issues.apache.org/jira/browse/TIKA-1666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14603961#comment-14603961
]
Tim Allison commented on TIKA-1666:
-----------------------------------
Y, X-Parsed-By shows TXTParser from tika-app. Would the right thing be to have
the RFC822Parser handle this file? Is the problem with the MSG parser not
detecting the file correctly/sending it to the right parser, or is the problem
that Tika isn't recognizing this as RFC822?
> No content extracted from eml files with media type message/x-emlx
> ------------------------------------------------------------------
>
> Key: TIKA-1666
> URL: https://issues.apache.org/jira/browse/TIKA-1666
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.0, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9
> Environment: Linux, Os-x, Windows
> Reporter: Tim Barrett
> Attachments: small cust.eml
>
>
> Our software uses Tika to parse large and diverse sets of customer files.
> Amongst these files we have eml files which are embedded within msg files.
> These eml files have a media type of message/x-emlx as detected by Media
> Detector. Although these are valid eml files (they can be opened and read on
> os-x for example), when they are parsed no content is detected or passed to
> the content handler.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)