[
https://issues.apache.org/jira/browse/PDFBOX-3440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15419833#comment-15419833
]
Svetlozar Argirov commented on PDFBOX-3440:
-------------------------------------------
Ok, so I don't know why is it important that xmpbox works only with namespaces
it knows.
To me it seems that it should just ignore and keep the unknown namespaces as
they are.
Here is one case https://github.com/zaro/pdf-metadata-editor/issues/4 , that
Calibre adds some metadata which cannot be parsed by xmpbox, and probably
there are plenty of others.
> DomXmpParser fails to parse xmp metadata with PRISM data in it
> --------------------------------------------------------------
>
> Key: PDFBOX-3440
> URL: https://issues.apache.org/jira/browse/PDFBOX-3440
> Project: PDFBox
> Issue Type: New Feature
> Components: XmpBox
> Affects Versions: 2.0.2
> Reporter: Svetlozar Argirov
> Attachments: TestPrismNamespace.java
>
>
> If the rdf stream in the document contains metadata from the PRISM namespace
> (http://www.prismstandard.org/specifications/2.0/PRISM_prism_namespace_2.0.pdf)
> , the parser fails with error:
> org.apache.xmpbox.xml.XmpParsingException: Cannot find a definition for the
> namespace http://prismstandard.org/namespaces/basic/2.0/
> at
> org.apache.xmpbox.xml.DomXmpParser.checkPropertyDefinition(DomXmpParser.java:859)
> at
> org.apache.xmpbox.xml.DomXmpParser.parseChildrenAsProperties(DomXmpParser.java:295)
> at
> org.apache.xmpbox.xml.DomXmpParser.parseDescriptionRoot(DomXmpParser.java:239)
> at org.apache.xmpbox.xml.DomXmpParser.parse(DomXmpParser.java:203)
> Jempbox used to parse this fine.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]