Philipp Steinkrueger created TIKA-1971:
------------------------------------------

             Summary: Email saved as .eml with no body not detected as rfc822, 
while same email saved as plain txt is.
                 Key: TIKA-1971
                 URL: https://issues.apache.org/jira/browse/TIKA-1971
             Project: Tika
          Issue Type: Bug
          Components: detector
    Affects Versions: 1.14
         Environment: Debian Jessie
Java(TM) SE Runtime Environment (build 1.8.0_91-b14)
Mac OSX Mail
            Reporter: Philipp Steinkrueger
            Priority: Minor


I save an email with no body text

(1) by dragging it from Mac Mail so that an .eml file is created
(2) by using 'Save As' in Mac Mail so that a .txt file is created

I then feed the files to Tika Server with the following command

curl -T filename http://localhost:9998/detect/stream

In case (1) the response is text/plain, while in case (2) the response is 
message/rfc822. This is strange, since (1) includes the full raw header, while 
(2) only includes a very abbreviated header.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to