[
https://issues.apache.org/jira/browse/PDFBOX-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Neil McErlean updated PDFBOX-764:
---------------------------------
Attachment: getMetadataKeys.patch
This patch adds a PDDocumentInformation.getMetadataKeys() method as described
above.
I didn't see any test code for testing metadata extraction, so I've added a new
test class for that too. It tests some standard metadata entries and some
custom ones.
> Access to metadata keys in the PD model
> ---------------------------------------
>
> Key: PDFBOX-764
> URL: https://issues.apache.org/jira/browse/PDFBOX-764
> Project: PDFBox
> Issue Type: Improvement
> Components: PDModel
> Affects Versions: 1.2.0
> Environment: All
> Reporter: Neil McErlean
> Priority: Minor
> Fix For: 1.3.0
>
> Attachments: getMetadataKeys.patch
>
> Original Estimate: 2h
> Remaining Estimate: 2h
>
> Currently 1.2.0 I don't think it's possible to get a document's metadata keys
> from the PD model.
> It is possible to get the metadata keys by calling
> PDDocumentInformation.getDictionary().keySet() and calling getName() on the
> COSName objects returned.
> But that means having to know about the COSDictionary and COSName objects
> i.e. the lower-level COS model.
> It would be nice to be able to retrieve the metadata keys directly from the
> PD model.
> I'm about to add a patch that does this. The patch will also add a new test
> class.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.