[jira] [Created] (PDFBOX-5405) "Page tree root must be a dictionary" when attempting to parse pdf

Johannes Wirkkala Westlund (Jira) Wed, 30 Mar 2022 07:16:06 -0700

Johannes Wirkkala Westlund created PDFBOX-5405:
--------------------------------------------------


             Summary: "Page tree root must be a dictionary" when attempting to 
parse pdf 
                 Key: PDFBOX-5405
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5405
             Project: PDFBox
          Issue Type: Bug
    Affects Versions: 2.0.25
            Reporter: Johannes Wirkkala Westlund
         Attachments: Grafiska riktlinjer, fordon LRV.pdf

Hi,

I have a PDF file that throws the following error when I try to parse it:
{{
}}
{code:java}
Caused by: java.io.IOException: Page tree root must be a dictionary
    at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
    at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
    at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228)
    at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1202)
    at org.apache.tika.parser.pdf.PDFParser.getPDDocument(PDFParser.java:191)
    at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:149)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:289)
    ... 5 more {code}
{{}}
I have attached the file in question with this issue.[^Grafiska riktlinjer, 
fordon LRV.pdf] 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Created] (PDFBOX-5405) "Page tree root must be a dictionary" when attempting to parse pdf

Reply via email to