[
https://issues.apache.org/jira/browse/PDFBOX-533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12796828#action_12796828
]
Phil Varner commented on PDFBOX-533:
------------------------------------
Mel,
I think it should be
while(pdfSource.available() > 0 && objIter.hasNext())
instead, so the call to next() returns the correct Integer when next() is
called later on.
This worked for me on a doc that threw the same exception.
I didn't see a separate JIRA issue for this, I'll gladly file and fix if
someone can provide a doc that the error occurs on (mine is confidential from a
customer).
> PDFTextStripper.writeCharacters is called no where in the class
> ---------------------------------------------------------------
>
> Key: PDFBOX-533
> URL: https://issues.apache.org/jira/browse/PDFBOX-533
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.8.0-incubator
> Reporter: Navendu Garg
> Attachments: TestPDFTextStripperPerf.java
>
>
> It seems writeCharacters method is not called anywhere in the PDFTextStripper
> class. This makes it impossible for handling character TextPosition as well
> as Line Separator because processLineSeparator method is no longer there and
> writeLineSeparator is called when actual writing happens.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.