[
https://issues.apache.org/jira/browse/PDFBOX-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301487#comment-14301487
]
Tilman Hausherr edited comment on PDFBOX-2653 at 2/2/15 5:48 PM:
-----------------------------------------------------------------
Also with ImageMagick:
{quote}convert.exe: Premature EOL at line 0 of strip 0 (got 0, expected 1728).
`Fax3Decode1D' @ warning/tiff.c/TIFFWarnings/857.{quote}
I've also e-mailed the author of IrfanView, but like ImageMagick, he uses
libtiff.
https://www.ietf.org/rfc/rfc2301.txt
{noformat}
In conventional fax, an MH-compressed fax data stream for a page
consists of the following sequence:
EOL, compressed data (first line), EOL, compressed data, ... ,
EOL, compressed data (last line), RTC (6 consecutive EOL codes)
Baseline TIFF does not use EOL codes or Return to Control (RTC)
sequences for MH-compressed data. However, the TIFF extension field
T4Options used in this specification for MH compression (Compression
= 3) requires EOLs.{noformat}
However the above is the specification of TIFF-FX. In TIFF
https://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf
"No EOL code words are used"
I've also opened a bug report at libtiff.
was (Author: tilman):
Also with ImageMagick:
{quote}convert.exe: Premature EOL at line 0 of strip 0 (got 0, expected 1728).
`Fax3Decode1D' @ warning/tiff.c/TIFFWarnings/857.{quote}
I've also e-mailed the author of IrfanView, but like ImageMagick, he uses
libtiff.
https://www.ietf.org/rfc/rfc2301.txt
{noformat}
In conventional fax, an MH-compressed fax data stream for a page
consists of the following sequence:
EOL, compressed data (first line), EOL, compressed data, ... ,
EOL, compressed data (last line), RTC (6 consecutive EOL codes)
Baseline TIFF does not use EOL codes or Return to Control (RTC)
sequences for MH-compressed data. However, the TIFF extension field
T4Options used in this specification for MH compression (Compression
= 3) requires EOLs.{noformat}
However the above is the specification of TIFF-FX. In TIFF
https://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf
"No EOL code words are used"
> Image extraction fails with attached PDF
> ----------------------------------------
>
> Key: PDFBOX-2653
> URL: https://issues.apache.org/jira/browse/PDFBOX-2653
> Project: PDFBox
> Issue Type: Bug
> Components: Utilities
> Affects Versions: 1.8.8, 2.0.0
> Environment: LinuxMint 1.7 64 bits
> Reporter: Paco Avila
> Assignee: Tilman Hausherr
> Fix For: 1.8.9, 2.0.0
>
> Attachments: PDFBOX-2653-1.tiff, image.pdf
>
>
> ExtractImages failed with the following exception:
> java.lang.NullPointerException
> at
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt$TiffWrapper.buildHeader(PDCcitt.java:651)
> at
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt$TiffWrapper.<init>(PDCcitt.java:497)
> at
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt$TiffWrapper.<init>(PDCcitt.java:488)
> at
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.write2OutputStream(PDCcitt.java:237)
> at
> org.apache.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file(PDXObjectImage.java:265)
> at
> org.apache.pdfbox.ExtractImages.processResources(ExtractImages.java:255)
> at org.apache.pdfbox.ExtractImages.extractImages(ExtractImages.java:180)
> at org.apache.pdfbox.ExtractImages.main(ExtractImages.java:80)
> at org.apache.pdfbox.PDFBox.main(PDFBox.java:62)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]