On Tue, Dec 25, 2012 at 3:41 AM, Patrick Questembert < [email protected]> wrote:
> The major languages such as English, French and Spanish come with a "cube" > version of the training data (e.g. eng.cube.*). So far we have used only > the regular training data (e.g. eng.traineddata). Can someone comment on: > - benefits of using the cube versions, i.e. is the accuracy gain > significant? > - speed tradeoff? > There is not a lot of information about cube. You will need to make your tests. Here are some information from forum: https://groups.google.com/forum/?fromgroups=#!searchin/tesseract-ocr/cube/tesseract-ocr/tyV5_z65XMk/lBpT6ptmEq0J > - how would I go about enabling the cube methods? Just placing the files > in the tessdata folder? Still passing "eng" as the language param to init? > Have a look at issue 661 comment 4[1] for simple example how to enable cube data. [1] https://code.google.com/p/tesseract-ocr/issues/detail?id=661#c4 Zdenko -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

