Philipp Steinkrueger created TIKA-1971:
------------------------------------------
Summary: Email saved as .eml with no body not detected as rfc822,
while same email saved as plain txt is.
Key: TIKA-1971
URL: https://issues.apache.org/jira/browse/TIKA-1971
Project: Tika
Issue Type: Bug
Components: detector
Affects Versions: 1.14
Environment: Debian Jessie
Java(TM) SE Runtime Environment (build 1.8.0_91-b14)
Mac OSX Mail
Reporter: Philipp Steinkrueger
Priority: Minor
I save an email with no body text
(1) by dragging it from Mac Mail so that an .eml file is created
(2) by using 'Save As' in Mac Mail so that a .txt file is created
I then feed the files to Tika Server with the following command
curl -T filename http://localhost:9998/detect/stream
In case (1) the response is text/plain, while in case (2) the response is
message/rfc822. This is strange, since (1) includes the full raw header, while
(2) only includes a very abbreviated header.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)