So, can we set MAX_NUM_CONFIGS to any larger number "n" to get Tesseract trained on "n" fonts ? Is "classifier gets slow" only trade-off in that case ?
On Wednesday, 8 July 2009 00:22:50 UTC+5:30, Ray Smith wrote: > > The 32 font limit (MAX_NUM_CONFIGS) was a hardware limit. (Long story) The > code that reads the inttemp file in 2.04 and below is specific to the value > of MAX_NUM_CONFIGS so you can increase it as long as you retrain yourself. > With 3.00, the data file reader is able to read files with a different > value of MAX_NUM_CONFIGS, and the default is increased to 64, BUT it slows > down the classifier, so it is a trade-off. > > The g4 in the language sources indicates that the tif files are group 4 > compressed, and therefore not readable by tesseract without libtiff. > > Ray. > > On Mon, Jul 6, 2009 at 11:22 PM, Alcareru <[email protected]<javascript:> > > wrote: > >> >> One more thing. Why the eng language sources (boxtiff-2.01.eng.tar.gz) >> have g4 in the image filenames? Like this: "eng.arial.g4.tif". >> >> > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

