Hi,

I'm trying to train new models to recognize handwritten numbers.  I've
formatted my training image and generated the box file successfully.
When trying to run tesseract for training, it seems to hang.  I've let
it run for several hours, and it's been using ~99% of the CPU without
changing it's memory footprint at all.  Is this normal, or can anybody
hazard a guess as to what the problem might be?  Below is all of the
output I've received from the process.

C:\tesseract-2.04.exe>tesseract "Z:\Users\andrewcuneo\Documents\temp
\fontfile.tif" junk nobatch box.train.stderr
Tesseract Open Source OCR Engine
Image has 8 * 3 bits per pixel, and size (2000,2000)
Resolution=72
APPLY_BOXES:
   Boxes read from boxfile:      90
   Initially labelled blobs:     90 in 6 rows
   Box failures detected:                    0
   Duped blobs for rebalance:     0
   "5" has fewest samples:     8
                                Total unlabelled words:        0
                                Final labelled words:         90
Generating training data
TRAINING ... Font name = UnknownFont.



Thanks,

Andrew

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to