[
https://issues.apache.org/jira/browse/TIKA-1268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128398#comment-14128398
] Tim Allison edited comment on TIKA-1268 at 9/10/14 12:13 PM: ------------------------------------------------------------- These should do it, no? Either with svn commandline: svn diff -c 1586159 Or: [viewvc PDF2XHTML|http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java?annotate=1586159] [patch PDF2XHTML|http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java?r1=1586158&r2=1586159&] [patch PDFParserTest|http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java?r1=1575116&r2=1586159&view=patch] Or do you mean the patch for PDFBox 2.0 (TIKA-1285??)? was (Author: [email protected]): These should do it, no? Either with svn commandline: svn diff -c 1586159 Or: [viewvc PDF2XHTML|http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java?annotate=1586159] [patch PDF2XHTML|http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java?r1=1586158&r2=1586159&] [patch PDFParserTest|http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java?r1=1575116&r2=1586159&view=patch] > Extract images from PDF documents > --------------------------------- > > Key: TIKA-1268 > URL: https://issues.apache.org/jira/browse/TIKA-1268 > Project: Tika > Issue Type: New Feature > Components: parser > Reporter: Jukka Zitting > Assignee: Jukka Zitting > Fix For: 1.6 > > > It would be nice if images within PDF documents could be extracted much like > embedded attachments are now being handled. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
