PDFTextStripper: allow access to currentPageNo variable
-------------------------------------------------------

                 Key: PDFBOX-726
                 URL: https://issues.apache.org/jira/browse/PDFBOX-726
             Project: PDFBox
          Issue Type: Improvement
          Components: Text extraction
    Affects Versions: 1.1.0
            Reporter: Ryan Nideffer
             Fix For: 1.2.0


I've extended org.apache.pdfbox.util.PDFTextStripper and I'm using it to 
perform a 2-pass extraction over a document. However, the second pass doesnt 
happen because I am unable to alter the variable currentPageNo, which maintains 
the current page number in the pdf document. It is a variable with access 
modifier of private, and only a get method is provided.

The only time currentPageNo is set to 0 is via 'writePage(PDDocument, 
OutputStream)' which I am overriding/not calling.

2 possible resolutions:
- make currentPageNo protected instead of private (preferred)
- add setCurrentPageNo method

Thank you,
Ryan

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to