On Sun, Feb 24, 2013 at 05:53:52PM +0100, zdenko podobny wrote:
> thanks for caring about this...  Maybe with would make a sense to make fork of
> these tools ;-) Just in a case that there will be nobody who will react on 
> your
> patches. And we case some time with applying several patches from issues ;-)

OK, I've set up a git repo with my patches now, you can get it from
https://gitorious.org/ancient-greek-training-for-tesseract/ocr-evaluation-tools

I didn't need to use the svn2git tool Tom mentioned, as the Google
Code project only ever had one commit, so there was no history to
lose.

I also spent some time last night working through the code and
managed to get wordacc to work properly with unicode as well (using
the wrapper script), so that's very good news. The README in the
repository has basic information on using it.

> • tool for measuring of training quality e.g. how many pages I need to
>   training to get reasonable result? If I add another similar font how it
>   effect OCR result (I have a bad experience on this)? Is there problem with
>   specific symbol (is there need to focus on some specific symbol)?

I have written a little shell script that runs various tests given a
.traineddata file, that may well come close to what you want. It
needs some cleaning up, but I should be able to release it in the
next few days.

I've been enjoying working with these tests rather more than I had
expected to. The best moment being when I ran a series of tests on
all of the .traineddata files I've created since I started my
training project, and the improvement was marvellously clear :)

Nick

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to