[
https://issues.apache.org/jira/browse/PDFBOX-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17850713#comment-17850713
]
Tilman Hausherr edited comment on PDFBOX-5832 at 5/30/24 1:25 PM:
------------------------------------------------------------------
I was afraid of that, it means that it's in a compressed object stream. What
you could try is to open the file with PDFDebugger, switch view to "cross
reference table",
!screenshot-2.png!
then look at the elements starting from the bottom. Here's an example, note the
/ObjStm:
!screenshot-1.png!
"First:768" is the offset of the first object. The object stream in my file
starts with number pairs like this: "27 0 28 1176 29 3009 30 3090 31 3147 32
3241 33 3279 34 3433 35 3488 36 3561 37 3629". 27 means object 27 starts at
offset 768+0=768. Object 28 starts at offset 768+1176=1944. So you need to find
an ObjStm object with a 129, save the object stream, go to the offset, and look
what's there. It should be something like this << /A 141 0 R /Next 138 0 R
/Parent 129 0 R /SE ?????? >> I'm wondering what's after SE. Maybe null, maybe
a non existent reference, maybe a syntax error.
was (Author: tilman):
I was afraid of that, it means that it's in a compressed object stream. What
you could try is to open the file with PDFDebugger, switch view to "cross
reference table",
!screenshot-2.png!
then look at the elements starting from the bottom. Here's an example, note the
/ObjStm:
!screenshot-1.png!
"First:768" is the offset of the first object. The object stream in my file
starts with number pairs like this: "27 0 28 1176 29 3009 30 3090 31 3147 32
3241 33 3279 34 3433 35 3488 36 3561 37 3629". 27 means object 27 starts at
offset 768+0=768. Object 28 starts at offset 768+1176=1944. So you need to find
an ObjStm object with a 129, save the object stream, go to the offset, and look
what's there.
> Error when writing a document with OutlineItems containing null SE objects
> --------------------------------------------------------------------------
>
> Key: PDFBOX-5832
> URL: https://issues.apache.org/jira/browse/PDFBOX-5832
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 3.0.2 PDFBox, 3.0.3 PDFBox
> Reporter: Arthur Renard
> Priority: Major
> Attachments: image-2024-05-30-11-58-21-024.png,
> image-2024-05-30-12-00-33-290.png, image-2024-05-30-12-01-30-708.png,
> image-2024-05-30-14-14-49-237.png, screenshot-1.png, screenshot-2.png
>
>
> Hello,
> I'm reaching out to you because we encountered some errors when loading
> documents after updating PDFBox to v3.0.2.
> I cloned the project in local env and tried with v3.0.3-SNAPSHOT but the same
> error appeared.
> When trying to save my document using the PDDocument save() method, the
> following exception occurs:
>
> {code:java}
> java.io.IOException: Error: Unknown type in object stream:COSObject{2240, 0}
> at
> org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeObject(COSWriterObjectStream.java:238)
> at
> org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeCOSDictionary(COSWriterObjectStream.java:341)
> at
> org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeObject(COSWriterObjectStream.java:230)
> at
> org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeObjectsToStream(COSWriterObjectStream.java:119)
> at
> org.apache.pdfbox.pdfwriter.COSWriter.doWriteBodyCompressed(COSWriter.java:499)
> at
> org.apache.pdfbox.pdfwriter.COSWriter.visitFromDocument(COSWriter.java:1307)
> {code}
> I can't share the document used for testing because it contains sensitive
> information, but after debugging a bit I found that it contains OutlineItems
> with null SE objects and that is apparently what's causing the error:
> !image-2024-05-30-11-58-21-024.png!
> !image-2024-05-30-12-00-33-290.png!
>
> The document was produced using Adobe Acrobat Pro 2020 20.5 30636
> !image-2024-05-30-12-01-30-708.png!
> Unfortunately I don't have access to this software and I coulnd't recreate a
> similar document to reproduce the issue.
>
> I found a user with a similar issue in your mailing lists :
> [https://www.mail-archive.com/[email protected]/msg13258.html]
>
> Let me know if you need more details regarding this problem.
>
> Also, if you are able to create a test document that would reproduce the
> issue, would you please mind sharing it? It would be of great help.
> Or if you have way to anonymize a document without altering its structure.
>
> Many thanks in advance!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]