Tim Barrett created TIKA-1666:
---------------------------------
Summary: No content extracted from eml files with media type
message/x-emlx
Key: TIKA-1666
URL: https://issues.apache.org/jira/browse/TIKA-1666
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 1.9, 1.8, 1.7, 1.6, 1.5, 1.4, 1.3, 1.2, 1.1, 1.0
Environment: Linux, Os-x, Windows
Reporter: Tim Barrett
Our software uses Tika to parse large and diverse sets of customer files.
Amongst these files we have eml files which are embedded within msg files.
These eml files have a media type of message/x-emlx as detected by Media
Detector. Although these are valid eml files (they can be opened and read on
os-x for example), when they are parsed no content is detected or passed to the
content handler.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)