[
https://issues.apache.org/jira/browse/PDFBOX-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
John Hewson updated PDFBOX-1086:
--------------------------------
Component/s: (was: Parsing)
> Error when decoding CCITT compressed data that contains EOLs, fill bits etc.
> ----------------------------------------------------------------------------
>
> Key: PDFBOX-1086
> URL: https://issues.apache.org/jira/browse/PDFBOX-1086
> Project: PDFBox
> Issue Type: Bug
> Reporter: Jeremias Maerki
> Assignee: Jeremias Maerki
>
> The TIFFFaxDecoder class (originally coming from JAI via XML Graphics
> Commons) does not handle cases like EOLs between lines and in front. But the
> PDF CCITTFaxDecode filter needs to allow many different variants of the
> encoding. Apparently, TIFF has a relatively restricted way of encoding CCITT
> data, so TIFFFaxDecoder was not written to be as flexible as we need it.
> Ideally, PDFBox should handle anything that gets thrown at it.
> It apprears that it would be rather difficult to retrofit TIFFFaxDecoder with
> the necessary flexibility. So, new decoders for T.4 and T.6 should probably
> be written.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)