Tim Allison created PDFBOX-5431:
-----------------------------------

             Summary: New NPE in xmpbox parser in trunk
                 Key: PDFBOX-5431
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5431
             Project: PDFBox
          Issue Type: Task
            Reporter: Tim Allison
         Attachments: metadata.xml

I noticed a new NPE in one of our test files on Tika when I recently built 
PDFBox's trunk.  I've attached the file.

If I don't set strict parsing to false, the parse works.


{noformat}
        DomXmpParser xmpParser = new DomXmpParser();
        xmpParser.setStrictParsing(false);
        Path p = Paths.get("/home/tallison/Desktop/tmp/META-INF/metadata.xml");
        try (InputStream is = Files.newInputStream(p)) {
            XMPMetadata metadata = xmpParser.parse(is);
            for (XMPSchema schema : metadata.getAllSchemas()) {
                for (AbstractField f : schema.getAllProperties()) {
                    System.out.println(f);
                }
            }
        }
{noformat}

Stack
{noformat}
ava.lang.NullPointerException
        at 
org.apache.xmpbox.xml.DomXmpParser.parseLiDescription(DomXmpParser.java:608)
        at 
org.apache.xmpbox.xml.DomXmpParser.parseLiElement(DomXmpParser.java:529)
        at org.apache.xmpbox.xml.DomXmpParser.manageArray(DomXmpParser.java:487)
        at 
org.apache.xmpbox.xml.DomXmpParser.createProperty(DomXmpParser.java:352)
        at 
org.apache.xmpbox.xml.DomXmpParser.parseChildrenAsProperties(DomXmpParser.java:319)
        at 
org.apache.xmpbox.xml.DomXmpParser.parseDescriptionRoot(DomXmpParser.java:248)
        at org.apache.xmpbox.xml.DomXmpParser.parse(DomXmpParser.java:201)
        at 
org.apache.tika.parser.indesign.IDMLParserTest.testXMP(IDMLParserTest.java:81)
{noformat}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to