Re: re-training quality

Ray Smith Wed, 17 Jun 2009 11:19:21 -0700

Running the same data through the training system multiple times does not
change accuracy in tesseract. It does not use a back-propagation training
process at this time.Ray.


On Fri, Jun 12, 2009 at 5:39 AM, Yury Tarasievich <
[email protected]> wrote:

>
> Is the quality of recognition expected to be
> improving significantly, if the new scans (same
> font and size, same book in fact) are processed
> into the .box files using the previous training
> results (-l <langcode>) and the resulting new
> training then merged with the bulk, then used
> for the next new scan? What is the expected
> quality curve (exponential, logarithmic etc.)?
> Is there any reliably known quantity of data
> that's expected to produce, say, 95% accuracy?
>
> I can't figure this out for myself. Don't see
> this happening with my data anyway yet.
>
> -Yury
>
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Re: re-training quality

Reply via email to