The English box and tiff files are available on the downloads page :
here<http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.eng.tar.gz>
 .Ray.

On Fri, Mar 27, 2009 at 12:20 AM, bergheil <[email protected]>wrote:

>
> Hello, where I can find an example of a good training? I need to
> understand what's is wrong in my training.
> Bye
>
> On 24 Mar, 18:39, bergheil <[email protected]> wrote:
> > Hi Ray,
> > I have more than 30 samples, there is only 1 font and I have more than
> > 5-10 samples for each char. I got "APPLY" box only for 1-2 box files,
> > I have excluded those from training.
> > If is possible to attach a tiff sample in this forum I will show you
> > the font that I use.
> > Bye
> >
> > On 24 Mar, 15:43, Ray Smith <[email protected]> wrote:
> >
> > > You might need more samples. The training process usually uses a
> minimum of
> > > 5-10 samples of each character in each font.Did you get any errors from
> > > applybox? See Important under Run Tesseract for Training.
> > > Ray.
> >
> > > On Tue, Mar 24, 2009 at 2:32 AM, bergheil <[email protected]
> >wrote:
> >
> > > > Hello, nobody can please explain me what is wrong in my training
> > > > process?
> > > > Please help me.
> >
> > > > On 20 Mar, 08:38, bergheil <[email protected]> wrote:
> > > > > Ciao a tutti,
> > > > > I'm building a training for recogninze the cmc7 fonts (only numbers
> > > > > and chars /^>! ).
> > > > > I have used for training an openoffice file with 3 lines of fonts
> > > > > wrote by myself and 8 file with the real document to recognize
> (bank
> > > > > check). After the training tesseract recognize the whole openoffice
> > > > > file and 50% of the 8 bank check.
> > > > > Yesterday I have added 21 new bank check, and there'isnt no
> > > > > improvement!!!! That's is impossible, so I can image that there is
> > > > > something wrong in my training works.
> > > > > The training documentation says : "The first step is to determine
> the
> > > > > full character set to be used, and prepare a text or word processor
> > > > > file containing a set of examples." , I have used the real document
> > > > > for the training not the word processor file (just 3 lines really),
> is
> > > > > it wrong?
> > > > > Please help me.
> > > > > Regards
> >
> > > > > O.S.: debian 4.0
> > > > > Tesseract: 2.03
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to