[ 
https://issues.apache.org/jira/browse/PDFBOX-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16483552#comment-16483552
 ] 

Tilman Hausherr commented on PDFBOX-4227:
-----------------------------------------


{code:java}
3 0 obj
<<
/Length 2374
/Type /Metadata
/Subtype /XML
/Filter /FlateDecode
>>
stream
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 5.4.0">
   <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#";>
      <rdf:Description rdf:about=""
            xmlns:xmp="http://ns.adobe.com/xap/1.0/";
            xmlns:dc="http://purl.org/dc/elements/1.1/";
            xmlns:xmpMM="http://ns.adobe.com/xap/1.0/mm/";
            xmlns:stEvt="http://ns.adobe.com/xap/1.0/sType/ResourceEvent#";
            xmlns:stRef="http://ns.adobe.com/xap/1.0/sType/ResourceRef#";
            xmlns:pdf="http://ns.adobe.com/pdf/1.3/";>
         <xmp:CreatorTool>Adobe InDesign CC 2017 (Macintosh)</xmp:CreatorTool>
         <xmp:MetadataDate>2017-12-18T17:55:15-05:00</xmp:MetadataDate>
         <xmp:CreateDate>2017-05-10T21:51:02Z</xmp:CreateDate>
         <xmp:ModifyDate>2017-12-18T17:55:15-05:00</xmp:ModifyDate>
         <dc:format>application/pdf</dc:format>
         
<xmpMM:OriginalDocumentID>xmp.did:8371e414-957c-4e45-abbf-58c95e32809c</xmpMM:OriginalDocumentID>
         <xmpMM:History>
            <rdf:Seq>
               <rdf:li rdf:parseType="Resource">
                  <stEvt:softwareAgent>Adobe InDesign CC 2017 
(Macintosh)</stEvt:softwareAgent>
                  <stEvt:changed>/</stEvt:changed>
                  <stEvt:action>converted</stEvt:action>
                  <stEvt:when>2016-11-11T16:11:57-05:00</stEvt:when>
                  <stEvt:parameters>from application/x-indesign to 
application/pdf</stEvt:parameters>
               </rdf:li>
            </rdf:Seq>
         </xmpMM:History>
         
<xmpMM:InstanceID>uuid:faabd5b1-924a-834b-b999-022fd0608e4f</xmpMM:InstanceID>
         
<xmpMM:DocumentID>xmp.id:932ce8eb-b215-4b81-82e8-4e0b8c029f67</xmpMM:DocumentID>
         <xmpMM:DerivedFrom rdf:parseType="Resource">
            <stRef:renditionClass>default</stRef:renditionClass>
            
<stRef:documentID>xmp.did:8371e414-957c-4e45-abbf-58c95e32809c</stRef:documentID>
            
<stRef:instanceID>xmp.iid:12fcd2dd-3fc4-4ab0-a5ee-85a8b58aa933</stRef:instanceID>
            
<stRef:originalDocumentID>xmp.did:8371e414-957c-4e45-abbf-58c95e32809c</stRef:originalDocumentID>
         </xmpMM:DerivedFrom>
         <xmpMM:RenditionClass>proof:pdf</xmpMM:RenditionClass>
         <pdf:Trapped>False</pdf:Trapped>
         <pdf:Producer>Mac OS X 10.12.4 Quartz PDFContext</pdf:Producer>
      </rdf:Description>
   </rdf:RDF>
</x:xmpmeta>

endstream
endobj
5 0 obj

{code}
The metadata is said to be flate decodeded but it isn't, obviously. The file 
renders fine with PDFBox too. It's just merging that won't work. Is the problem 
only with this file? If so, open it with NOTEPAD++ and overwrite /Filter 
/FlateDecode with the same amount of blanks.

> java.util.zip.DataFormatException: invalid distance too far back
> ----------------------------------------------------------------
>
>                 Key: PDFBOX-4227
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4227
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 2.0.9
>            Reporter: Michael Miller
>            Priority: Major
>         Attachments: testCorrupt.pdf
>
>
> I get a zip exception with the attached PDF file, which renders successfully 
> in a couple of PDF viewers I tried.
> ByteArrayInputStream pdfInputStream = new ByteArrayInputStream(pdfByteArray);
> PDFMergerUtility merger = new PDFMergerUtility();
> merger.addSource(pdfInputStream);
> ByteArrayOutputStream outputStream = new ByteArrayOutputStream();
> merger.setDestinationStream(outputStream);
> merger.mergeDocuments(MemoryUsageSetting.setupMainMemoryOnly());
>  
> PDF file attached.
> Stack Trace:
> java.io.IOException: java.util.zip.DataFormatException: invalid distance too 
> far back
>  at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:83)
>  at org.apache.pdfbox.filter.Filter.decode(Filter.java:87)
>  at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:77)
>  at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:175)
>  at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:163)
>  at 
> org.apache.pdfbox.multipdf.PDFMergerUtility.appendDocument(PDFMergerUtility.java:516)
>  at 
> org.apache.pdfbox.multipdf.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:263)
>  ...
>  Caused by: java.util.zip.DataFormatException: invalid distance too far back
>  at java.util.zip.Inflater.inflateBytes(Native Method)
>  at java.util.zip.Inflater.inflate(Inflater.java:259)
>  at java.util.zip.Inflater.inflate(Inflater.java:280)
>  at org.apache.pdfbox.filter.FlateFilter.decompress(FlateFilter.java:108)
>  at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:74)
>  ... 35 more



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to