On Wed, Feb 27, 2013 at 11:54:39AM +0000, Nick White wrote: > On Sun, Feb 24, 2013 at 05:53:52PM +0100, zdenko podobny wrote: > > • tool for measuring of training quality e.g. how many pages I need to > > training to get reasonable result? If I add another similar font how it > > effect OCR result (I have a bad experience on this)? Is there problem with > > specific symbol (is there need to focus on some specific symbol)? > > I have written a little shell script that runs various tests given a > .traineddata file, that may well come close to what you want. It > needs some cleaning up, but I should be able to release it in the > next few days.
Right, they're ready to share now. Get the testing scripts from here: https://gitorious.org/ancient-greek-training-for-tesseract/trainingtestscripts I don't have a lot of time to devote to them at the moment, but hopefully they'll be useful. There's a README which hopefully explain things well enough. And of course comments and patches are most welcome! Nick -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

