Miguel,

Also, my text extraction was extremely poor until the TESSDATA_PREFIX environment variable was set to point to the tessdata folder! Until then, it couldn't even find the dictionary.

*http://www.mail-archive.com/[email protected]/msg01852.html*

Karen

On 4/3/2012 1:51 PM, Karen Dolan wrote:
Miguel,

Matterhorn trunk (from a couple weeks ago) was configured to pull down Leptonica 1.66 and Tesseract 3.00. I went and retrieved Leptonica 1.67 and Tesseract 3.01 directly, along with the latest Tesseract English dictionary (Reference: http://code.google.com/p/tesseract-ocr/wiki/ReadMe).

The text extraction is now much better than it was a few months ago.

Good luck!
Karen



On 4/3/2012 11:29 AM, Miguel Del Agua wrote:
Hi,

I just installed version 1.3 and seems to work correctly, but the OCR
performance is quite poor. I've tried to install a new dictionary as
it's said in the wiki but the performance still bad. So I would like
to know if it's possible to improve text recognition either by
changing some parameters of OCRopus or improving in some way the
dictionary.

Thanks in advance.
_______________________________________________
Matterhorn mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn


To unsubscribe please email
[email protected]
_______________________________________________


_______________________________________________
Matterhorn mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn


To unsubscribe please email
[email protected]
_______________________________________________

Reply via email to