Dmitry,

unfortunately I have not enough of time for tests :-(. I still hope Ray will
release more info before final 3.01. At the moment I focus on box editor.

BR,

Zdenko

On Tue, Feb 22, 2011 at 9:27 AM, Dmitry Silaev <[email protected]>wrote:

> Interesting. I was wondering about Cube since its traces began to
> appear in the source code but had no enough time to investigate it
> thorougly
>
> Zdenko, would you please kindly share your other findings on Cube?
>
> Regards,
> Dmitry
>
> On Tue, Feb 22, 2011 at 11:13 AM, zdenko podobny <[email protected]> wrote:
> > I doubt that google will release their (full) training set :-(
> > Have a look at svn to file eng.cube.size [1]. You can see there name of
> > fonts that was training for English in 3.01. As far as I understood there
> is
> > (unpublished/not released) possibility to train language data directly on
> > font files. Unfortunately there are no detail for "cube" part of
> training.
> > Zd.
> > [1] 12,4Mb!
> http://code.google.com/p/tesseract-ocr/source/browse/trunk/tessdata/eng.cube.size
> > On Wed, Feb 9, 2011 at 5:48 PM, Sly_bzh <[email protected]> wrote:
> >>
> >> I would like to train tesseract for English with some special fonts.
> >> Tesseract training documentation says that a text should be prepared
> >> and it must follow some important points (see
> >>
> >>
> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Generate_Training_Images
> )
> >>
> >> Could someone provide to the community the content of a good and
> >> efficient text for english training ?
> >>
> >> Note : I think it could be useful to provide the texts that have been
> >> used to build the training files that could be downloaded in the
> >> "Download" section (http://code.google.com/p/tesseract-ocr/downloads/
> >> list). What do you think about that ?
> >>
> >> Thanks !
> >>
> >> --
> >> You received this message because you are subscribed to the Google
> Groups
> >> "tesseract-ocr" group.
> >> To post to this group, send email to [email protected].
> >> To unsubscribe from this group, send email to
> >> [email protected].
> >> For more options, visit this group at
> >> http://groups.google.com/group/tesseract-ocr?hl=en.
> >>
> >
> > --
> > You received this message because you are subscribed to the Google Groups
> > "tesseract-ocr" group.
> > To post to this group, send email to [email protected].
> > To unsubscribe from this group, send email to
> > [email protected].
> > For more options, visit this group at
> > http://groups.google.com/group/tesseract-ocr?hl=en.
> >
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected].
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to