[ 
https://issues.apache.org/jira/browse/PDFBOX-533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12796828#action_12796828
 ] 

Phil Varner commented on PDFBOX-533:
------------------------------------

Mel, 

I think it should be 

            while(pdfSource.available() > 0 && objIter.hasNext())

instead, so the call to next() returns the correct Integer when next() is 
called later on.

This worked for me on a doc that threw the same exception.

I didn't see a separate JIRA issue for this, I'll gladly file and fix if 
someone can provide a doc that the error occurs on (mine is confidential from a 
customer).

 

> PDFTextStripper.writeCharacters is called no where in the class
> ---------------------------------------------------------------
>
>                 Key: PDFBOX-533
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-533
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 0.8.0-incubator
>            Reporter: Navendu Garg
>         Attachments: TestPDFTextStripperPerf.java
>
>
> It seems writeCharacters method is not called anywhere in the PDFTextStripper 
> class. This makes it impossible for handling character TextPosition as well 
> as Line Separator because processLineSeparator method is no longer there and 
> writeLineSeparator is called when actual writing happens.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to