Am 10.05.2018 um 16:26 schrieb Ankit Inkollu:
Hi All,
*Scenario:*
I need to verify if the check-box for a certain field in a non-acro form
PDF is ticked or not.
*Options tried:*
1. I tried to search for any class in PDFBOX which points to the check-box
but could not find any.
There isn't if it isn't acroform nor xfa. A box is just a box, i.e. a
shape somewhere. (Unless the character for a checked box is used)
2. Tried using the co-ordinates of the check-box and create an image and
then compare it against an already stored image of a check-box but this is
quite cumbersome and fails for few PDFs.
Is there a way in PDFBox which can implement the above mentioned scenario.
If this does not work out, is there an OCR API in JAVA which will help.
Tesseract has a java interface. But not PDFBox. Tika has an OCR option
and it will use tesseract.
Tilman
Do let me know if any of you have faced such a situation.
Thanks
Ankit
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]