Hi,

Am 30.01.25 um 01:02 schrieb NH Rao:
Greetings,

PDFBox version: 3.0.3

I am trying to get the text of a certain page. Interesting page is found by
looking up outline item and finding out page corresponding to my keyword
from outline.
If you are starting with an outline item, you may have a look at

org.apache.pdfbox.text.PDFTextStripper.setStartBookmark(PDOutlineItem)
org.apache.pdfbox.text.PDFTextStripper.setEndBookmark(PDOutlineItem)

to extract the page content you are looking for

Andreas


As far as I could see, text stripper class allows page limits by page
number, but I don't know what my page number is. I noticed processPage
method and tried calling it directly. It threw NPE as output stream was not
set. Tried overriding the class and setting the output stream.

This seems to be working, but wanted confirm this is the right way to
continue.

Regards,

Niranjan



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Reply via email to