After some discussion at the Plone documentation list the document was
published as a How-To, here it is:

http://plone.org/documentation/how-to/ocr-in-plone-using-tesseract-ocr

If anyone can help make this into a product, or improve the scripts that
would be very helpful.

Thanks,
Nathan
_____________
ABillionBillion.com
Document Management for Everyone




duffyd wrote:
> 
> Hi Nathan,
> 
> On Thu, 2007-12-27 at 19:06 -0800, NA wrote:
> 
>> I created a script, which takes an image file (currently TIF), OCRs it
>> using 
>> http://code.google.com/p/tesseract-ocr/ Tesseract  and returns the
>> recognized text.
>> 
>> The script takes the image as a parameter and passes it to Tesseract OCR
>> on
>> the filesystem, using the command line; Tesseract OCRs the file and then
>> returns the resulting text. The script then creates a new text file in
>> Plone, in the same folder as the image, with the recognized text. I call
>> the
>> external script from an internal script, which is called from an action.
>> 
>> I assume in order to upload this to plone.org it needs to be a product
>> but I
>> don't know how to turn it into a product. Honestly, I just wanted to get
>> it
>> to work so I could OCR documents using Plone and it does now. 
>> 
>> I was wondering if anyone wants to see the script and help turn it into a
>> product so it is available to the Plone community?
>> 
>> Also, if I don't turn it into a product, is there another way I could
>> post
>> it so people could use it and perhaps contribute to it?
> 
> 
> You could create a tutorial[1] on plone.org and upload the scripts as
> attachments and/or post the code into the tutorial - that's a great way
> to contribute!
> 
> Thanks,
> Tim
> 
> [1] http://plone.org/documentation
> 
> 
>> 
>> Thanks,
>> Nathan
>> ----------------------
>> http://www.abillionbillion.com ABillionBillion.com 
>> Document Management for Everyone
> 
> _______________________________________________
> Product-Developers mailing list
> [email protected]
> http://lists.plone.org/mailman/listinfo/product-developers
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Tesseract-OCR-in-Plone%21-tp14521687s20094p14770351.html
Sent from the Product Developers mailing list archive at Nabble.com.


_______________________________________________
Product-Developers mailing list
[email protected]
http://lists.plone.org/mailman/listinfo/product-developers

Reply via email to