Re: General information about training procedure

Ray Smith Tue, 24 Mar 2009 07:44:23 -0700

You might need more samples. The training process usually uses a minimum of
5-10 samples of each character in each font.Did you get any errors from
applybox? See Important under Run Tesseract for Training.
Ray.


On Tue, Mar 24, 2009 at 2:32 AM, bergheil <[email protected]>wrote:

>
> Hello, nobody can please explain me what is wrong in my training
> process?
> Please help me.
>
> On 20 Mar, 08:38, bergheil <[email protected]> wrote:
> > Ciao a tutti,
> > I'm building a training for recogninze the cmc7 fonts (only numbers
> > and chars /^>! ).
> > I have used for training an openoffice file with 3 lines of fonts
> > wrote by myself and 8 file with the real document to recognize (bank
> > check). After the training tesseract recognize the whole openoffice
> > file and 50% of the 8 bank check.
> > Yesterday I have added 21 new bank check, and there'isnt no
> > improvement!!!! That's is impossible, so I can image that there is
> > something wrong in my training works.
> > The training documentation says : "The first step is to determine the
> > full character set to be used, and prepare a text or word processor
> > file containing a set of examples." , I have used the real document
> > for the training not the word processor file (just 3 lines really), is
> > it wrong?
> > Please help me.
> > Regards
> >
> > O.S.: debian 4.0
> > Tesseract: 2.03
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Re: General information about training procedure

Reply via email to