[
https://issues.apache.org/jira/browse/PDFBOX-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Lehmkühler closed PDFBOX-1066.
--------------------------------------
Resolution: Not a Problem
Assignee: Andreas Lehmkühler
PDFs aren't organized in lines. So, if you want to read a pdf line by line you
have to extract the whole text first. It should be easy to process that result
line by line without PDFBox.
> There is no functionlaity of reading the text line by line with its input
> field
> -------------------------------------------------------------------------------
>
> Key: PDFBOX-1066
> URL: https://issues.apache.org/jira/browse/PDFBOX-1066
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.7.3
> Environment: Windows
> Reporter: Nishant
> Assignee: Andreas Lehmkühler
> Labels: patch
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> I am trying to read the PDF texts along with its input type like
> textfield/checkboxes. What i found is TextStripper is pasing the whole
> document and retuning the string in getText(). And using Acroform.getfields i
> am able ot get all fields.
> But I have perticuler requierment of reading the texts and its input type. Do
> we have any class/method which can resolve this issue.
> Its very urgent.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)