[ 
https://issues.apache.org/jira/browse/TIKA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15822136#comment-15822136
 ] 

Nicholas DiPiazza commented on TIKA-2232:
-----------------------------------------

[~pascal.essiembre] totally

obviously with the GPL3 license most people cannot use this jbig2-imageio 
Library. So can we please provide a way to turn off this exception?

{code}
org.apache.pdfbox.filter.MissingImageReaderException: Cannot read JBIG2 image: 
jbig2-imageio is not installed
        at org.apache.pdfbox.filter.Filter.findImageReader(Filter.java:128) 
~[pdfbox-2.0.1.jar:2.0.1]
        at org.apache.pdfbox.filter.JBIG2Filter.decode(JBIG2Filter.java:55) 
~[pdfbox-2.0.1.jar:2.0.1]
        at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69) 
~[pdfbox-2.0.1.jar:2.0.1]
        at 
org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:163) 
~[pdfbox-2.0.1.jar:2.0.1]
        at 
org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235) 
~[pdfbox-2.0.1.jar:2.0.1]
        at 
org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.<init>(PDImageXObject.java:147)
 ~[pdfbox-2.0.1.jar:2.0.1]
        at 
org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70) 
~[pdfbox-2.0.1.jar:2.0.1]
        at 
org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:385) 
~[pdfbox-2.0.1.jar:2.0.1]
        at 
org.apache.tika.parser.pdf.PDF2XHTML.extractImages(PDF2XHTML.java:359) 
~[tika-parsers-1.13.jar:1.13]
        at org.apache.tika.parser.pdf.PDF2XHTML.endPage(PDF2XHTML.java:271) 
~[tika-parsers-1.13.jar:1.13]
        at 
org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:393) 
~[pdfbox-2.0.1.jar:2.0.1]
        at org.apache.tika.parser.pdf.PDF2XHTML.processPage(PDF2XHTML.java:214) 
~[tika-parsers-1.13.jar:1.13]
        at 
org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319) 
~[pdfbox-2.0.1.jar:2.0.1]
        at 
org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266) 
~[pdfbox-2.0.1.jar:2.0.1]
        
{code}

> Add JBIG2 image parsing support
> -------------------------------
>
>                 Key: TIKA-2232
>                 URL: https://issues.apache.org/jira/browse/TIKA-2232
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>    Affects Versions: 1.14
>         Environment: Any
>            Reporter: Pascal Essiembre
>            Assignee: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.15
>
>
> If you are interested, I would like to add support for JBIG2 image files 
> (.jb2, or .jbig2).  I have encountered them PDFs.
> I will make a pull-request shortly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to