Arthur Renard created PDFBOX-5832:
-------------------------------------

             Summary: Error when loading a document with OutlineItems 
containing null SE properties
                 Key: PDFBOX-5832
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5832
             Project: PDFBox
          Issue Type: Bug
    Affects Versions: 3.0.2 PDFBox, 3.0.3 PDFBox
            Reporter: Arthur Renard
         Attachments: image-2024-05-30-11-58-21-024.png, 
image-2024-05-30-12-00-33-290.png, image-2024-05-30-12-01-30-708.png

Hello,

I'm reaching out to you because we encountered some errors when loading 
documents after updating PDFBox to v3.0.2.

I cloned the project in local env and tried with v3.0.3-SNAPSHOT but the same 
error appeared.

When trying to load my document using the 
Loader.loadPDF(file) method, the following exception occured: 
 
{code:java}
java.io.IOException: Error: Unknown type in object stream:COSObject{2240, 0}
        at 
org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeObject(COSWriterObjectStream.java:238)
       at 
org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeCOSDictionary(COSWriterObjectStream.java:341)
        at 
org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeObject(COSWriterObjectStream.java:230)
       at 
org.apache.pdfbox.pdfwriter.compress.COSWriterObjectStream.writeObjectsToStream(COSWriterObjectStream.java:119)
      at 
org.apache.pdfbox.pdfwriter.COSWriter.doWriteBodyCompressed(COSWriter.java:499) 
     at 
org.apache.pdfbox.pdfwriter.COSWriter.visitFromDocument(COSWriter.java:1307) 
{code}
I can't share the document used for testing because it contains sensitive 
information, but after debugging a bit I found that it contains OutlineItems 
with null SE objects and that is apparently what's causing the error: 
!image-2024-05-30-11-58-21-024.png!
!image-2024-05-30-12-00-33-290.png!
 
The document was produced using Adobe Acrobat Pro 2020 20.5 30636
!image-2024-05-30-12-01-30-708.png!
Unfortunately I don't have access to this software and I coulnd't recreate a 
similar document to reproduce the issue.
 
I found a user with a similar issue in your mailing lists : 
[https://www.mail-archive.com/users@pdfbox.apache.org/msg13258.html]
 
Let me know if you need more details regarding this problem.
 
Also, if you are able to create a test document that would reproduce the issue, 
would you please mind sharing it? It would be of great help.
Or if you have way to anonymize a document without altering its structure.
 
Many thanks in advance!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to