[
https://issues.apache.org/jira/browse/TIKA-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134369#comment-14134369
]
Nick Burch commented on TIKA-1415:
----------------------------------
We have unit tests which show Tika (trunk) successfully detecting and
extracting embedded resources (including word documents) from within a
PowerPoint .ppt file
Any chance you could write a small junit unit test showing your problem? And
including a sample powerpoint file if you can't reproduce the issue on the Tika
test PPT files.
> PowerPoint2003 embedded with word. The embedded file can not be detected.
> -------------------------------------------------------------------------
>
> Key: TIKA-1415
> URL: https://issues.apache.org/jira/browse/TIKA-1415
> Project: Tika
> Issue Type: Bug
> Components: detector, parser
> Affects Versions: 1.5
> Environment: window7
> Reporter: sunxingzhe
> Labels: Tika, poi
>
> Word2003 or word2007 insert into Powerpoint2003 as embedded file。
> The embedded file‘s type can not be detected。
> The embedded file's content can not be parsed.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)