Hello, many thanks for the follow-up! I see, however if the value reading step is out of control (no interceptor to register for handling) I am afraid the value will already be in a form (casted into something else) with no point saving to in preferred encoding, but I will of course check PDDebugger, I had not noticed!
with kind regards Athanasios On Wed, Feb 5, 2020 at 7:06 AM Tilman Hausherr <thaush...@t-online.de> wrote: > Hi, > > There isn't. PDFBox extracts in strings which you can then save in the > encoding you want. > > To see more about the PDF, use PDFDebugger. > > Tilman > > Am 02.02.2020 um 19:01 schrieb Athanasios Viennas: > > Hello! I am looking out for the functionality of encoding in a > > <code>org.apache.pdfbox.util.PDFTextStripperByArea</code> as the > > constructor specification has been removed after release 1.8.13, now > > looking out to use it in release 2.0.18. > > I want to have control over parsing a pdf document in research paper type > > of two-column layout written in Greek. I am unable to inspect the exact > > encoding but anyhow if there is a alternative definition in a different > > configuration class and see how it works but I can't spot the alternate > > class where that has moved to. > > > > with kind regards > > Athanasios Viennas > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org > For additional commands, e-mail: users-h...@pdfbox.apache.org > >