> > Did you use PDFBox to extract the > > text or another (open source) tool? > > We are using several different tools -- perhaps I'll write a comparison > in some years ;-).
"In some years" would be to late for me...*sigh* There is this command line utility ExtractText coming (soon, the website says) with PDFBox. If this tool would be able to extrect not just the text but also the position I think it maybe could solve the problem. > It's already more than I expected when I started ;-). ...like always. ;) Best regards, Widuk -- GRATIS für alle GMX-Mitglieder: Die maxdome Movie-FLAT! Jetzt freischalten unter http://portal.gmx.net/de/go/maxdome01