[
https://issues.apache.org/jira/browse/TIKA-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134700#comment-14134700
]
Tim Allison commented on TIKA-1414:
-----------------------------------
[~tpalsulich], great. Thank you! It might make sense to have two examples,
one with basic EmbeddedResourceHandler with
ParserContainerExtractor...something that writes attachments to output files;
and one specifically for PDFParser and inline images. Y, we do have a working
example in at least one unit test for PDFParser (testEmbeddedFilesInChildren).
> How to extract embedded images from PDFs?
> -----------------------------------------
>
> Key: TIKA-1414
> URL: https://issues.apache.org/jira/browse/TIKA-1414
> Project: Tika
> Issue Type: Bug
> Components: cli
> Affects Versions: 1.6
> Environment: *ubuntu 14.04*
> 3.13.0-35-generic
> 64 bit
> *java version "1.6.0_32"*
> OpenJDK Runtime Environment (IcedTea6 1.13.4) (6b32-1.13.4-4ubuntu0.12.04.2)
> OpenJDK Server VM (build 23.25-b01, mixed mode)`
> Reporter: Damiano
> Labels: features
>
> Hello,
> as i reported in TIKA-1396 I am tring to extract embedded images from PDF
> files. It has not been resolved in TIka 1.6.
> I am not able to extract images from *CLI* using *--extract* parameter.
> How can I extract those images?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)