[ https://issues.apache.org/jira/browse/TIKA-396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12856826#action_12856826 ]
Jukka Zitting commented on TIKA-396: ------------------------------------ In revision 933903 I modified the OutlookExtractor to use the parser instance in the ParseContext instead of a hardcoded AutoDetectParser when parsing the attachments. This is similar to what the PackageParser does, and allows better client-level control of the parsing process. Note that there's now an extra "Invalid attachment id" line being printed to system out as a part of the tika-parsers test suite. I guess this comes from POI. > Parser Attachements from Outlook Messages > ----------------------------------------- > > Key: TIKA-396 > URL: https://issues.apache.org/jira/browse/TIKA-396 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 0.6 > Environment: All environments. > Reporter: Dave Meikle > Assignee: Dave Meikle > > As raised by Albert Jensen on the tika-user mailing list[1], it would be good > for the Outlook Parser to iterate through the mails attachments and then > extract their content. > [1]http://mail-archives.apache.org/mod_mbox/lucene-tika-user/201003.mbox/%3c002701cacccf$16108b40$4231a1...@mail.dk%3e -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira