[jira] [Commented] (TIKA-1414) How to extract embedded images from PDFs?

Tim Allison (JIRA) Mon, 15 Sep 2014 16:44:33 -0700

    [ 
https://issues.apache.org/jira/browse/TIKA-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134700#comment-14134700
 ]


Tim Allison commented on TIKA-1414:
-----------------------------------

[~tpalsulich], great.  Thank you!  It might make sense to have two examples, 
one with basic EmbeddedResourceHandler with 
ParserContainerExtractor...something that writes attachments to output files; 
and one specifically for PDFParser and inline images.  Y, we do have a working 
example in at least one unit test for PDFParser (testEmbeddedFilesInChildren).

> How to extract embedded images from PDFs?
> -----------------------------------------
>
>                 Key: TIKA-1414
>                 URL: https://issues.apache.org/jira/browse/TIKA-1414
>             Project: Tika
>          Issue Type: Bug
>          Components: cli
>    Affects Versions: 1.6
>         Environment: *ubuntu 14.04*
> 3.13.0-35-generic
> 64 bit
> *java version "1.6.0_32"*
> OpenJDK Runtime Environment (IcedTea6 1.13.4) (6b32-1.13.4-4ubuntu0.12.04.2)
> OpenJDK Server VM (build 23.25-b01, mixed mode)`
>            Reporter: Damiano
>              Labels: features
>
> Hello,
> as i reported in TIKA-1396 I am tring to extract embedded images from PDF 
> files. It has not been resolved in TIka 1.6.
> I am not able to extract images from *CLI* using *--extract* parameter.
> How can I extract those images?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TIKA-1414) How to extract embedded images from PDFs?

Reply via email to