[
https://issues.apache.org/jira/browse/TIKA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15822136#comment-15822136
]
Nicholas DiPiazza commented on TIKA-2232:
-----------------------------------------
[~pascal.essiembre] totally
obviously with the GPL3 license most people cannot use this jbig2-imageio
Library. So can we please provide a way to turn off this exception?
{code}
org.apache.pdfbox.filter.MissingImageReaderException: Cannot read JBIG2 image:
jbig2-imageio is not installed
at org.apache.pdfbox.filter.Filter.findImageReader(Filter.java:128)
~[pdfbox-2.0.1.jar:2.0.1]
at org.apache.pdfbox.filter.JBIG2Filter.decode(JBIG2Filter.java:55)
~[pdfbox-2.0.1.jar:2.0.1]
at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69)
~[pdfbox-2.0.1.jar:2.0.1]
at
org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:163)
~[pdfbox-2.0.1.jar:2.0.1]
at
org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235)
~[pdfbox-2.0.1.jar:2.0.1]
at
org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.<init>(PDImageXObject.java:147)
~[pdfbox-2.0.1.jar:2.0.1]
at
org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70)
~[pdfbox-2.0.1.jar:2.0.1]
at
org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:385)
~[pdfbox-2.0.1.jar:2.0.1]
at
org.apache.tika.parser.pdf.PDF2XHTML.extractImages(PDF2XHTML.java:359)
~[tika-parsers-1.13.jar:1.13]
at org.apache.tika.parser.pdf.PDF2XHTML.endPage(PDF2XHTML.java:271)
~[tika-parsers-1.13.jar:1.13]
at
org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:393)
~[pdfbox-2.0.1.jar:2.0.1]
at org.apache.tika.parser.pdf.PDF2XHTML.processPage(PDF2XHTML.java:214)
~[tika-parsers-1.13.jar:1.13]
at
org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319)
~[pdfbox-2.0.1.jar:2.0.1]
at
org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266)
~[pdfbox-2.0.1.jar:2.0.1]
{code}
> Add JBIG2 image parsing support
> -------------------------------
>
> Key: TIKA-2232
> URL: https://issues.apache.org/jira/browse/TIKA-2232
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Affects Versions: 1.14
> Environment: Any
> Reporter: Pascal Essiembre
> Assignee: Tim Allison
> Priority: Minor
> Fix For: 2.0, 1.15
>
>
> If you are interested, I would like to add support for JBIG2 image files
> (.jb2, or .jbig2). I have encountered them PDFs.
> I will make a pull-request shortly.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)