Hello list, I want to search for words in a PDF document and get their positions. It seems that PDFBox ignores text which has been entered into a form field although it's rendered correctly. I can be reproduced easily with the standalone app:
java -jar pdfbox-app-3.0.2.jar export:text -i=Test.pdf java -jar pdfbox-app-3.0.2.jar render -i=Test.pdf The Acrobat both finds and extracts text which have been entered into a form field. In my code I use PDFTextStripper. I haven't found any way to configure the behaviour. Is it a bug or have I overlooked something? For clarification: I don't want to search for the value ('V') but its visual representation ('AP'). Kind regards, Dipl.-Ing. (FH) Paul Grütter Head of Development [Beschreibung: Beschreibung: Beschreibung: signotec_eSig_96dpi_192x44px_cmyk-] signotec GmbH Am Gierath 20b 40885 Ratingen (Germany) Tel.: +49 2102 53575-10 Fax: +49 2102 53575-39 E-Mail: paul.gruet...@signotec.de<mailto:paul.gruet...@signotec.de> URL: www.signotec.com<http://www.signotec.com/> Amtsgericht Düsseldorf: HRB 44307 Geschäftsführung/CEO: Arne Brandes [cid:image002.png@01DA7B7F.F9D1F300]<https://www.facebook.com/signotecgmbh/> [cid:image003.png@01DA7B7F.F9D1F300] <https://www.instagram.com/signotec_gmbh/> [cid:image004.png@01DA7B7F.F9D1F300] <https://www.linkedin.com/company/signotec-gmbh/> [cid:image005.png@01DA7B7F.F9D1F300] <https://www.xing.com/pages/signotecgmbh> [cid:image006.png@01DA7B7F.F9D1F300] <https://www.youtube.com/user/signotec1> [cid:image007.png@01DA7B7F.F9D1F300]<https://en.signotec.com/sustainability>
--------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org