Box/Tiff for Chinese

2011-02-06 Thread devTess
I would like to change the recognition for e.g 10 -20 characters that do not work with the current language data, questions a) Is there a way to un-concatenate the language data for re-use in training? b) When will there be a box/tiff file for chinese? c) For text that has a mixture of chinese

Re: Provide/visualize baseline info?

2011-02-06 Thread Dmitry Silaev
Here are the brief instructions on how to set up the Tesseract interactive debug environment (ScrollView) on Windows: 1. Make sure you have Java Runtime Environment installed 2. Download my home-brewed single archived installation suite from http://www.4shared.com/get/Z4gnbJdP/tess_debug.