thanks a lot, that looks like the clean solution! For type0 fonts, no textposition is created, but I can live with that.
Thanks, Wouter On Thu, Mar 30, 2017 at 6:51 PM, Tilman Hausherr <thaush...@t-online.de> wrote: > The problem is that some files do this as an obfuscation technique. > > What might be detected is fonts that don't have unicode extraction. See in > LegacyPDFStreamEngine "if (unicode == null)". Make your own or extend it > and check for TextPosition objects with unicode null. (See > PrintTextLocations example from the source code download on how to get > TextPosition objects). > > Tilman > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org > For additional commands, e-mail: users-h...@pdfbox.apache.org > > -- Wouter De Borger, PhD Co-founder Inmanta www.inmanta.com Email: wouter.debor...@inmanta.com