Johannes Wirkkala Westlund created PDFBOX-5405:
--------------------------------------------------

             Summary: "Page tree root must be a dictionary" when attempting to 
parse pdf 
                 Key: PDFBOX-5405
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5405
             Project: PDFBox
          Issue Type: Bug
    Affects Versions: 2.0.25
            Reporter: Johannes Wirkkala Westlund
         Attachments: Grafiska riktlinjer, fordon LRV.pdf

Hi,

I have a PDF file that throws the following error when I try to parse it:
{{
}}
{code:java}
Caused by: java.io.IOException: Page tree root must be a dictionary
    at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
    at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
    at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228)
    at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1202)
    at org.apache.tika.parser.pdf.PDFParser.getPDDocument(PDFParser.java:191)
    at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:149)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:289)
    ... 5 more {code}
{{}}
I have attached the file in question with this issue.[^Grafiska riktlinjer, 
fordon LRV.pdf] 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to