[ 
https://issues.apache.org/jira/browse/PDFBOX-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301487#comment-14301487
 ] 

Tilman Hausherr edited comment on PDFBOX-2653 at 2/17/15 2:35 PM:
------------------------------------------------------------------

Also with ImageMagick: 
{quote}convert.exe: Premature EOL at line 0 of strip 0 (got 0, expected 1728). 
`Fax3Decode1D' @ warning/tiff.c/TIFFWarnings/857.{quote}
I've also e-mailed the author of IrfanView, but like ImageMagick, he uses 
libtiff.


https://www.ietf.org/rfc/rfc2301.txt
{noformat}
In conventional fax, an MH-compressed fax data stream for a page
consists of the following sequence:

   EOL, compressed data (first line), EOL, compressed data, ... ,
   EOL, compressed data (last line), RTC (6 consecutive EOL codes)

Baseline TIFF does not use EOL codes or Return to Control (RTC)
sequences for MH-compressed data. However, the TIFF extension field
T4Options used in this specification for MH compression (Compression
= 3) requires EOLs.{noformat}
However the above is the specification of TIFF-FX. In TIFF
https://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf
"No EOL code words are used"

I've also opened a [bug report at 
libtiff|http://bugzilla.maptools.org/show_bug.cgi?id=2502].


was (Author: tilman):
Also with ImageMagick: 
{quote}convert.exe: Premature EOL at line 0 of strip 0 (got 0, expected 1728). 
`Fax3Decode1D' @ warning/tiff.c/TIFFWarnings/857.{quote}
I've also e-mailed the author of IrfanView, but like ImageMagick, he uses 
libtiff.


https://www.ietf.org/rfc/rfc2301.txt
{noformat}
In conventional fax, an MH-compressed fax data stream for a page
consists of the following sequence:

   EOL, compressed data (first line), EOL, compressed data, ... ,
   EOL, compressed data (last line), RTC (6 consecutive EOL codes)

Baseline TIFF does not use EOL codes or Return to Control (RTC)
sequences for MH-compressed data. However, the TIFF extension field
T4Options used in this specification for MH compression (Compression
= 3) requires EOLs.{noformat}
However the above is the specification of TIFF-FX. In TIFF
https://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf
"No EOL code words are used"

I've also opened a bug report at libtiff.

> Image extraction fails with attached PDF
> ----------------------------------------
>
>                 Key: PDFBOX-2653
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2653
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Utilities
>    Affects Versions: 1.8.8, 2.0.0
>         Environment: LinuxMint 1.7 64 bits
>            Reporter: Paco Avila
>            Assignee: Tilman Hausherr
>              Labels: CCITTFaxDecode
>             Fix For: 1.8.9, 2.0.0
>
>         Attachments: PDFBOX-2653-1.tiff, image.pdf
>
>
> ExtractImages failed with the following exception:
> java.lang.NullPointerException
>       at 
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt$TiffWrapper.buildHeader(PDCcitt.java:651)
>       at 
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt$TiffWrapper.<init>(PDCcitt.java:497)
>       at 
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt$TiffWrapper.<init>(PDCcitt.java:488)
>       at 
> org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.write2OutputStream(PDCcitt.java:237)
>       at 
> org.apache.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file(PDXObjectImage.java:265)
>       at 
> org.apache.pdfbox.ExtractImages.processResources(ExtractImages.java:255)
>       at org.apache.pdfbox.ExtractImages.extractImages(ExtractImages.java:180)
>       at org.apache.pdfbox.ExtractImages.main(ExtractImages.java:80)
>       at org.apache.pdfbox.PDFBox.main(PDFBox.java:62)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to