Tilman Hausherr created PDFBOX-2643: ---------------------------------------
Summary: "XMP type violation in stRef:instanceID" not reported by preflight Key: PDFBOX-2643 URL: https://issues.apache.org/jira/browse/PDFBOX-2643 Project: PDFBox Issue Type: Sub-task Components: Preflight Affects Versions: 2.0.0 Reporter: Tilman Hausherr In the Bavaria test suite, PDFLib claims that the attached file is not a valid PDF/A-1b file, because "Property stRef:instanceID in document XMP requires scheme identifier" or "XMP type violation in stRef:instanceID" (They make both claims in Bavaria.xml). {code} <rdf:Description rdf:about="" xmlns:xmpMM="http://ns.adobe.com/xap/1.0/mm/" xmlns:stRef="http://ns.adobe.com/xap/1.0/sType/ResourceRef#"> <xmpMM:InstanceID>uuid:b429d411-e628-45ca-b932-d2c77fbe6cd3</xmpMM:InstanceID> <xmpMM:DocumentID>adobe:docid:indd:db084a4d-dbb2-11dc-ac34-beb3cc4028ec</xmpMM:DocumentID> <xmpMM:RenditionClass>proof:pdf</xmpMM:RenditionClass> <xmpMM:DerivedFrom rdf:parseType="Resource"> <stRef:instanceID>6544a661-c065-11dc-854c-dd4f35453e8b</stRef:instanceID> <stRef:documentID>adobe:docid:indd:fa7c6589-9f4a-11dc-9641-af983df728d7</stRef:documentID> </xmpMM:DerivedFrom> </rdf:Description> {code} PDF-Tools considers the file to be correct. But according to http://www.pdflib.com/fileadmin/pdflib/pdf/pdfa/2009-05-04-Bavaria-report-on-PDFA-validation-accuracy.pdf they don't raise the correct alarm for XMP violations. The PDFLib xmp checker also considers the XMP to be correct. "6544a661-c065-11dc-854c-dd4f35453e8b" does not look like a valid URI to me although the regex mentioned at http://tools.ietf.org/html/rfc3986#appendix-B thinks it is. [~msahyoun]what do you get for that file? The Bavaria Testsuite is already 5 years old, so maybe Adobe/Callas have improved their product. (Another unreported error for that file is "xapGImg:height for xmp:Thumbnails in document XMP does not match the actual base64-encoded image data") -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org