Dmitry, unfortunately I have not enough of time for tests :-(. I still hope Ray will release more info before final 3.01. At the moment I focus on box editor.
BR, Zdenko On Tue, Feb 22, 2011 at 9:27 AM, Dmitry Silaev <[email protected]>wrote: > Interesting. I was wondering about Cube since its traces began to > appear in the source code but had no enough time to investigate it > thorougly > > Zdenko, would you please kindly share your other findings on Cube? > > Regards, > Dmitry > > On Tue, Feb 22, 2011 at 11:13 AM, zdenko podobny <[email protected]> wrote: > > I doubt that google will release their (full) training set :-( > > Have a look at svn to file eng.cube.size [1]. You can see there name of > > fonts that was training for English in 3.01. As far as I understood there > is > > (unpublished/not released) possibility to train language data directly on > > font files. Unfortunately there are no detail for "cube" part of > training. > > Zd. > > [1] 12,4Mb! > http://code.google.com/p/tesseract-ocr/source/browse/trunk/tessdata/eng.cube.size > > On Wed, Feb 9, 2011 at 5:48 PM, Sly_bzh <[email protected]> wrote: > >> > >> I would like to train tesseract for English with some special fonts. > >> Tesseract training documentation says that a text should be prepared > >> and it must follow some important points (see > >> > >> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Generate_Training_Images > ) > >> > >> Could someone provide to the community the content of a good and > >> efficient text for english training ? > >> > >> Note : I think it could be useful to provide the texts that have been > >> used to build the training files that could be downloaded in the > >> "Download" section (http://code.google.com/p/tesseract-ocr/downloads/ > >> list). What do you think about that ? > >> > >> Thanks ! > >> > >> -- > >> You received this message because you are subscribed to the Google > Groups > >> "tesseract-ocr" group. > >> To post to this group, send email to [email protected]. > >> To unsubscribe from this group, send email to > >> [email protected]. > >> For more options, visit this group at > >> http://groups.google.com/group/tesseract-ocr?hl=en. > >> > > > > -- > > You received this message because you are subscribed to the Google Groups > > "tesseract-ocr" group. > > To post to this group, send email to [email protected]. > > To unsubscribe from this group, send email to > > [email protected]. > > For more options, visit this group at > > http://groups.google.com/group/tesseract-ocr?hl=en. > > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

