[ https://issues.apache.org/jira/browse/PDFBOX-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16483552#comment-16483552 ]
Tilman Hausherr commented on PDFBOX-4227: ----------------------------------------- {code:java} 3 0 obj << /Length 2374 /Type /Metadata /Subtype /XML /Filter /FlateDecode >> stream <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 5.4.0"> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"> <rdf:Description rdf:about="" xmlns:xmp="http://ns.adobe.com/xap/1.0/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xmpMM="http://ns.adobe.com/xap/1.0/mm/" xmlns:stEvt="http://ns.adobe.com/xap/1.0/sType/ResourceEvent#" xmlns:stRef="http://ns.adobe.com/xap/1.0/sType/ResourceRef#" xmlns:pdf="http://ns.adobe.com/pdf/1.3/"> <xmp:CreatorTool>Adobe InDesign CC 2017 (Macintosh)</xmp:CreatorTool> <xmp:MetadataDate>2017-12-18T17:55:15-05:00</xmp:MetadataDate> <xmp:CreateDate>2017-05-10T21:51:02Z</xmp:CreateDate> <xmp:ModifyDate>2017-12-18T17:55:15-05:00</xmp:ModifyDate> <dc:format>application/pdf</dc:format> <xmpMM:OriginalDocumentID>xmp.did:8371e414-957c-4e45-abbf-58c95e32809c</xmpMM:OriginalDocumentID> <xmpMM:History> <rdf:Seq> <rdf:li rdf:parseType="Resource"> <stEvt:softwareAgent>Adobe InDesign CC 2017 (Macintosh)</stEvt:softwareAgent> <stEvt:changed>/</stEvt:changed> <stEvt:action>converted</stEvt:action> <stEvt:when>2016-11-11T16:11:57-05:00</stEvt:when> <stEvt:parameters>from application/x-indesign to application/pdf</stEvt:parameters> </rdf:li> </rdf:Seq> </xmpMM:History> <xmpMM:InstanceID>uuid:faabd5b1-924a-834b-b999-022fd0608e4f</xmpMM:InstanceID> <xmpMM:DocumentID>xmp.id:932ce8eb-b215-4b81-82e8-4e0b8c029f67</xmpMM:DocumentID> <xmpMM:DerivedFrom rdf:parseType="Resource"> <stRef:renditionClass>default</stRef:renditionClass> <stRef:documentID>xmp.did:8371e414-957c-4e45-abbf-58c95e32809c</stRef:documentID> <stRef:instanceID>xmp.iid:12fcd2dd-3fc4-4ab0-a5ee-85a8b58aa933</stRef:instanceID> <stRef:originalDocumentID>xmp.did:8371e414-957c-4e45-abbf-58c95e32809c</stRef:originalDocumentID> </xmpMM:DerivedFrom> <xmpMM:RenditionClass>proof:pdf</xmpMM:RenditionClass> <pdf:Trapped>False</pdf:Trapped> <pdf:Producer>Mac OS X 10.12.4 Quartz PDFContext</pdf:Producer> </rdf:Description> </rdf:RDF> </x:xmpmeta> endstream endobj 5 0 obj {code} The metadata is said to be flate decodeded but it isn't, obviously. The file renders fine with PDFBox too. It's just merging that won't work. Is the problem only with this file? If so, open it with NOTEPAD++ and overwrite /Filter /FlateDecode with the same amount of blanks. > java.util.zip.DataFormatException: invalid distance too far back > ---------------------------------------------------------------- > > Key: PDFBOX-4227 > URL: https://issues.apache.org/jira/browse/PDFBOX-4227 > Project: PDFBox > Issue Type: Bug > Affects Versions: 2.0.9 > Reporter: Michael Miller > Priority: Major > Attachments: testCorrupt.pdf > > > I get a zip exception with the attached PDF file, which renders successfully > in a couple of PDF viewers I tried. > ByteArrayInputStream pdfInputStream = new ByteArrayInputStream(pdfByteArray); > PDFMergerUtility merger = new PDFMergerUtility(); > merger.addSource(pdfInputStream); > ByteArrayOutputStream outputStream = new ByteArrayOutputStream(); > merger.setDestinationStream(outputStream); > merger.mergeDocuments(MemoryUsageSetting.setupMainMemoryOnly()); > > PDF file attached. > Stack Trace: > java.io.IOException: java.util.zip.DataFormatException: invalid distance too > far back > at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:83) > at org.apache.pdfbox.filter.Filter.decode(Filter.java:87) > at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:77) > at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:175) > at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:163) > at > org.apache.pdfbox.multipdf.PDFMergerUtility.appendDocument(PDFMergerUtility.java:516) > at > org.apache.pdfbox.multipdf.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:263) > ... > Caused by: java.util.zip.DataFormatException: invalid distance too far back > at java.util.zip.Inflater.inflateBytes(Native Method) > at java.util.zip.Inflater.inflate(Inflater.java:259) > at java.util.zip.Inflater.inflate(Inflater.java:280) > at org.apache.pdfbox.filter.FlateFilter.decompress(FlateFilter.java:108) > at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:74) > ... 35 more -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org