[
https://issues.apache.org/jira/browse/PDFBOX-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13674212#comment-13674212
]
deepak commented on PDFBOX-1502:
--------------------------------
Hi Andreas,
Maruan : Thanks for your feedback . I concur with you , PDF if saved as text in
Acrobat does not display fields .
But Andreas can you please confirm whether this is the expected behaviour from
PDF Box and no changes needs to be done.
If there is a change required , is it a significant effort to have this ability
. The reason for this question is we need this ability from PDF Box in some of
our workflows.
Regards
Deepak
> Not Extracting Text from PDF Document
> -------------------------------------
>
> Key: PDFBOX-1502
> URL: https://issues.apache.org/jira/browse/PDFBOX-1502
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.8.0-incubator, 1.7.1, 1.8.0
> Environment: Mac OS , jdk 1.7
> Reporter: deepak
> Assignee: Andreas Lehmkühler
> Attachments: PDFBOX1502-RenewalAdvice.txt,
> Renewal_Advice_Edited_Extracted_Text.txt, Renewal_Advice_Edited.pdf, Renewal
> Advice .pdf
>
>
> PDDocument document = PDDocument.load(Inputstream);
> PDFTextStripper stripper = new PDFTextStripper();
> stripper.getText(document) is not returning some text content in the
> attached PDF Document . It is just returning the form fields but the values
> are empty . The bug is reproducible both in 1.8.0-Snapshot and 1.7.1
> codebase.
> Please help in resolving the issue
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira