[jira] [Updated] (PDFBOX-1792) Different metadata extracted with NonSequentialPDFParser vs classic parser on some documents

Tim Allison (JIRA) Mon, 09 Dec 2013 20:06:36 -0800

     [ 
https://issues.apache.org/jira/browse/PDFBOX-1792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Tim Allison updated PDFBOX-1792:
--------------------------------

    Description: The traditional parser is able to extract metadata from a test 
document from TIKA-738.  The NonSequentialPDFParser is not able to extract 
metadata from that file.  Another file from the Tika test suite has metadata 
that can be extracted by the NonSequentialPDFParser but not by classic.   (was: 
The traditional parser is able to extract metadata from the Annotation test 
document from TIKA-738.  The NonSequentialPDFParser is not able to extract 
metadata.)
        Summary: Different metadata extracted with NonSequentialPDFParser vs 
classic parser on some documents  (was: Metadata not completely extracted with 
NonSequentialPDFParser on some documents)

> Different metadata extracted with NonSequentialPDFParser vs classic parser on 
> some documents
> --------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1792
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1792
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.8.3
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: PDFBOX-1792.tar.gz
>
>
> The traditional parser is able to extract metadata from a test document from 
> TIKA-738.  The NonSequentialPDFParser is not able to extract metadata from 
> that file.  Another file from the Tika test suite has metadata that can be 
> extracted by the NonSequentialPDFParser but not by classic. 



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (PDFBOX-1792) Different metadata extracted with NonSequentialPDFParser vs classic parser on some documents

Reply via email to