I created a script, which takes an image file (currently TIF), OCRs it using http://code.google.com/p/tesseract-ocr/ Tesseract and returns the recognized text.
The script takes the image as a parameter and passes it to Tesseract OCR on the filesystem, using the command line; Tesseract OCRs the file and then returns the resulting text. The script then creates a new text file in Plone, in the same folder as the image, with the recognized text. I call the external script from an internal script, which is called from an action. I assume in order to upload this to plone.org it needs to be a product but I don't know how to turn it into a product. Honestly, I just wanted to get it to work so I could OCR documents using Plone and it does now. I was wondering if anyone wants to see the script and help turn it into a product so it is available to the Plone community? Also, if I don't turn it into a product, is there another way I could post it so people could use it and perhaps contribute to it? Thanks, Nathan ---------------------- http://www.abillionbillion.com ABillionBillion.com Document Management for Everyone -- View this message in context: http://www.nabble.com/Tesseract-OCR-in-Plone%21-tp14521687s20094p14521687.html Sent from the Product Developers mailing list archive at Nabble.com. _______________________________________________ Product-Developers mailing list [email protected] http://lists.plone.org/mailman/listinfo/product-developers
