Jani Pehkonen created PDFBOX-4211:
-------------------------------------
Summary: Some text is missing in JBIG2 images
Key: PDFBOX-4211
URL: https://issues.apache.org/jira/browse/PDFBOX-4211
Project: PDFBox
Issue Type: Bug
Components: JBIG2
Affects Versions: 2.0.9
Reporter: Jani Pehkonen
Attachments: EncodedTable.patch
[This
PDF|http://library.sciencemadness.org/library/books/unitized_experiments_in_ochem.pdf]
has problems.
See pages "iii" (2), "ix" (6), "x" (7) etc. These pages have most of their text
missing or in wrong position when rendered in PDFDebugger. Each page is a JBIG2
image that uses Huffman encoding. You may need to use for example -Xmx1000M to
avoid OutOfMemoryError because the JBIG2 images are very large.
Apply my patch to file EncodedTable.java in package
org.apache.pdfbox.jbig2.decoder.huffman. I have fixed one line of code that
does not follow the JBIG2 standard.
The JBIG2 standard is freely available
[here|https://www.itu.int/rec/T-REC-T.88-200002-I/en]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]