Matthew Caruana Galizia created TIKA-2280:
---------------------------------------------

             Summary: message_from not extracted from Outlook emails
                 Key: TIKA-2280
                 URL: https://issues.apache.org/jira/browse/TIKA-2280
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.14
            Reporter: Matthew Caruana Galizia
            Priority: Minor


While the MESSAGE_FROM metadata field is extracted for RFC emails, it isn't for 
Outlook emails. The closest thing we have for Outlook emails is the creator 
field, which only includes the name (but not the email address).

Currently I'm getting the from address from the RAW_HEADER_FROM field, but it 
would be nice to be able to use a standard across email formats.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to