thanks a lot, that looks like the clean solution!

For type0 fonts, no textposition is created, but I can live with that.

Thanks,
Wouter

On Thu, Mar 30, 2017 at 6:51 PM, Tilman Hausherr <thaush...@t-online.de>
wrote:

> The problem is that some files do this as an obfuscation technique.
>
> What might be detected is fonts that don't have unicode extraction. See in
> LegacyPDFStreamEngine "if (unicode == null)". Make your own or extend it
> and check for TextPosition objects with unicode null. (See
> PrintTextLocations example from the source code download on how to get
> TextPosition objects).
>
> Tilman
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: users-h...@pdfbox.apache.org
>
>


-- 
Wouter De Borger, PhD
Co-founder Inmanta
www.inmanta.com
Email: wouter.debor...@inmanta.com

Reply via email to