Hi,
Am 11.09.2011 18:18, schrieb Sent:
Hello,
I don't see a way to conclude from the extracted text at which page a certain
text/keyword is located. I have written a script that will use -startPage and
-endPage with the same number so that I get only one page. However, doing this
100 times in a row for a 100 page document is very slow.
Would it be possible to add an option that will add a page number indicator,
e.g.<PAGENUM=n> at the beginning of each page during text extraction?
Thank you for considering this.
Hmm, probably it'll help if you define your own page separator using
PDFTextStripper#getPageSeparator. Ok, you can't include the pagenumber but as a
starter ...
Cheers
Ralf
BR
Andreas Lehmkühler