Hi, On Thu, Oct 29, 2009 at 5:36 PM, Roman Klinger <roman.klin...@scai.fraunhofer.de> wrote: > I do not think you can really solve that problem in a nice way. I deal with > such problems with a post processing of the extracted text (e.g. replace ¨a > by ä.
It would be nice to have such heuristics included in PDFBox. BR, Jukka Zitting