Rainer Schnitker created TIKA-4140:
--------------------------------------

             Summary: For Outlook emails with a signature, the attachments are 
not processed.
                 Key: TIKA-4140
                 URL: https://issues.apache.org/jira/browse/TIKA-4140
             Project: Tika
          Issue Type: Bug
          Components: handler
    Affects Versions: 2.9.0
         Environment: Java 17
            Reporter: Rainer Schnitker
         Attachments: Outlook-Mail-Signature.zip

For Outlook emails with a signature, the attachments are not processed. It is 
not entirely clear whether the class 
"org.apache.tika.parser.microsoft.OutlookExtractor" has a problem or the POI 
component used.


The issure attachement zip file has the same example with or without signature:
 * File "HTML-Mail without Signature.msg"
 * File "HTML-Mail Signature Elster.msg"

case a) the attachements (word and pdf) are processed

case b) the attachements are not processed  (only one blob, base64 encoded)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to