[jira] [Commented] (PDFBOX-1502) Not Extracting Text from PDF Document

deepak (JIRA) Tue, 04 Jun 2013 03:08:11 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13674212#comment-13674212
 ]


deepak commented on PDFBOX-1502:
--------------------------------

Hi Andreas,

Maruan : Thanks for your feedback . I concur with you , PDF if saved as text in 
Acrobat does not display fields . 

But Andreas can you please confirm whether this is the expected behaviour from 
PDF Box and no changes needs to be done. 

If there is a change required , is it a significant effort to have this ability 
. The reason for this question is we need this ability from PDF Box in some of 
our workflows.

Regards
Deepak
                
> Not Extracting Text from PDF Document
> -------------------------------------
>
>                 Key: PDFBOX-1502
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1502
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 0.8.0-incubator, 1.7.1, 1.8.0
>         Environment: Mac OS , jdk 1.7
>            Reporter: deepak
>            Assignee: Andreas Lehmkühler
>         Attachments: PDFBOX1502-RenewalAdvice.txt, 
> Renewal_Advice_Edited_Extracted_Text.txt, Renewal_Advice_Edited.pdf, Renewal 
> Advice .pdf
>
>
> PDDocument  document = PDDocument.load(Inputstream);
> PDFTextStripper stripper = new PDFTextStripper();
> stripper.getText(document)   is not returning some text content in the 
> attached PDF Document . It is just returning the form fields but the values 
> are empty .  The bug is reproducible both in 1.8.0-Snapshot and 1.7.1 
> codebase.
> Please help in resolving the issue

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (PDFBOX-1502) Not Extracting Text from PDF Document

Reply via email to