Rainer Schnitker created TIKA-4140:
--------------------------------------
Summary: For Outlook emails with a signature, the attachments are
not processed.
Key: TIKA-4140
URL: https://issues.apache.org/jira/browse/TIKA-4140
Project: Tika
Issue Type: Bug
Components: handler
Affects Versions: 2.9.0
Environment: Java 17
Reporter: Rainer Schnitker
Attachments: Outlook-Mail-Signature.zip
For Outlook emails with a signature, the attachments are not processed. It is
not entirely clear whether the class
"org.apache.tika.parser.microsoft.OutlookExtractor" has a problem or the POI
component used.
The issure attachement zip file has the same example with or without signature:
* File "HTML-Mail without Signature.msg"
* File "HTML-Mail Signature Elster.msg"
case a) the attachements (word and pdf) are processed
case b) the attachements are not processed (only one blob, base64 encoded)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)