> If you know, where everything you want to highlight is, it is not too > much work. It is a lot work to extract this information.
What do you mean by "where everything is"? Is it enough to know the "word" or do I have to know the exact coordinates and height and with of every word I want to highlight? Did you use PDFBox to extract the text or another (open source) tool? > We are working on that with several people for months ;-). Not very encouraging... > Perhaps you get an idea of the problems from a poster we had on a > workshop: > > http://dx.doi.org/10.1038/npre.2009.3141.1 Thanks for the link. Seems that you really invested a lot of time in this topic. Best Regards, Widuk -- GRATIS für alle GMX-Mitglieder: Die maxdome Movie-FLAT! Jetzt freischalten unter http://portal.gmx.net/de/go/maxdome01