Hello, I would like to announce new version 1.01 of pyTesseractTrainer - successor of tesseractTrainer.py<http://tesseract-ocr.googlecode.com/files/tesseractTrainer.py> Version 1.00 is identical with tesseractTrainer.py.
Features: - visual editor of box file - layout of symbol from box file reflect symbols on image - possibility to define bold, italic, and underline font - deleting, joining, splitting of symbols/boxes - easy and exact way of adjusting boxes - support for opening different image formats (tiff, png, jpeg, bmp, gif) - multi-platform support (tested on Linux 64 bit and Windows XP) Buxfixes (in 1.01): - unicode support - opening of tesseract v3.00 box file (but save support only v2.0x box file) - identify/imagick is not need anymore - correct error that block to open file on Windows - solved issues regarding training symbols @ and $ (used also to identify bold and italic font) - workaround for missing Numeric support in PyGTK Because IFAIK nobody react on Catalin e-mail I offered him to create project to collect patches and possibly to solve known issues. Because of my low time resource project is looking still for owner/contributors. Warmly welcomed are expect for python (multi-platform) GUI (GTK/QT/wx...) because performance issues - on Windows XP (2GB memory) script crash or freezes during opening file with a lot of boxes/symbols (e.g. eng.arial.g4.tif), on Mandrivalinux 2010.164 bit (6GB memory) it take to open&display 15 minutes! BR, Zd. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

