Hello,

I would like to announce new version 1.01 of pyTesseractTrainer - successor
of 
tesseractTrainer.py<http://tesseract-ocr.googlecode.com/files/tesseractTrainer.py>
Version
1.00 is identical with tesseractTrainer.py.

Features:

   - visual editor of box file
   - layout of symbol from box file reflect symbols on image
   - possibility to define bold, italic, and underline font
   - deleting, joining, splitting of symbols/boxes
   - easy and exact way of adjusting boxes
   - support for opening different image formats (tiff, png, jpeg, bmp, gif)
   - multi-platform support (tested on Linux 64 bit and Windows XP)

Buxfixes (in 1.01):

   - unicode support
   - opening of tesseract v3.00 box file (but save support only v2.0x box
   file)
   - identify/imagick is not need anymore
   - correct error that block to open file on Windows
   - solved issues regarding training symbols @ and $ (used also to identify
   bold and italic font)
   - workaround for missing Numeric support in PyGTK


Because IFAIK nobody react on Catalin e-mail I offered him to create project
to collect patches and possibly to solve known issues. Because of my low
time resource project is looking still for owner/contributors. Warmly
welcomed are expect for python (multi-platform) GUI (GTK/QT/wx...)
 because performance issues - on Windows XP (2GB memory) script crash or
freezes during opening file with a lot of boxes/symbols (e.g.
eng.arial.g4.tif), on Mandrivalinux 2010.164 bit (6GB memory) it take to
open&display 15 minutes!

BR,

Zd.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to