[ 
https://issues.apache.org/jira/browse/PDFBOX-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Neil McErlean updated PDFBOX-764:
---------------------------------

    Attachment: getMetadataKeys.patch

This patch adds a PDDocumentInformation.getMetadataKeys() method as described 
above.

I didn't see any test code for testing metadata extraction, so I've added a new 
test class for that too. It tests some standard metadata entries and some 
custom ones.

> Access to metadata keys in the PD model
> ---------------------------------------
>
>                 Key: PDFBOX-764
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-764
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: PDModel
>    Affects Versions: 1.2.0
>         Environment: All
>            Reporter: Neil McErlean
>            Priority: Minor
>             Fix For: 1.3.0
>
>         Attachments: getMetadataKeys.patch
>
>   Original Estimate: 2h
>  Remaining Estimate: 2h
>
> Currently 1.2.0 I don't think it's possible to get a document's metadata keys 
> from the PD model.
> It is possible to get the metadata keys by calling 
> PDDocumentInformation.getDictionary().keySet() and calling getName() on the 
> COSName objects returned.
> But that means having to know about the COSDictionary and COSName objects 
> i.e. the lower-level COS model.
> It would be nice to be able to retrieve the metadata keys directly from the 
> PD model.
> I'm about to add a patch that does this. The patch will also add a new test 
> class.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to