On Sun, Feb 24, 2013 at 05:53:52PM +0100, zdenko podobny wrote: > thanks for caring about this... Maybe with would make a sense to make fork of > these tools ;-) Just in a case that there will be nobody who will react on > your > patches. And we case some time with applying several patches from issues ;-)
OK, I've set up a git repo with my patches now, you can get it from https://gitorious.org/ancient-greek-training-for-tesseract/ocr-evaluation-tools I didn't need to use the svn2git tool Tom mentioned, as the Google Code project only ever had one commit, so there was no history to lose. I also spent some time last night working through the code and managed to get wordacc to work properly with unicode as well (using the wrapper script), so that's very good news. The README in the repository has basic information on using it. > • tool for measuring of training quality e.g. how many pages I need to > training to get reasonable result? If I add another similar font how it > effect OCR result (I have a bad experience on this)? Is there problem with > specific symbol (is there need to focus on some specific symbol)? I have written a little shell script that runs various tests given a .traineddata file, that may well come close to what you want. It needs some cleaning up, but I should be able to release it in the next few days. I've been enjoying working with these tests rather more than I had expected to. The best moment being when I ran a series of tests on all of the .traineddata files I've created since I started my training project, and the improvement was marvellously clear :) Nick -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

