I appreciate the quick reply! I don't know why the files didn't attach, that's very odd - I will have to repost them when I am home and also investigate as to if/how I forgot the mftraining step, and if so if that solves the issue. Thanks!
On Tuesday, March 5, 2013 3:21:00 AM UTC-5, zdenop wrote: > > There are no atttached data. Maybe try to use some online storage system > (google disk, skydrive, dropbox...) and send a link here. > > You stated you are following wiki instruction[1], but you log shows it is > not true - you did not run mftraining. > > [1] http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 > > Zdenko > > > On Tue, Mar 5, 2013 at 4:14 AM, A. Naut <[email protected] <javascript:>>wrote: > >> I'm trying to train the attached files (Tesseract 3.02, following the >> instructions at >> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 ) , and >> although I can compete the training process successfully I can't get >> tesseract to work with the produce trainneddata file - I always receive the >> error: >> >> tessdata_manager.SeekToStart(TESSDATA_INTTEMP):Error:Assert failed:in >> file adaptmatch.cpp, line 555 >> >> I have attached the .box, .tif, and font_properties file I used for >> training purposes. (Although the training instructions says to add .exp? >> after the font name in the font_properties file, when I use ocr.exp0 as the >> font name in that file the shape clustering than fails). >> >> >> The following is the process I use for producing the training file: >> >> ./tesseract eng.icr.exp0.tif eng.icr.exp0 nobatch box.train.stderr >> Tesseract Open Source OCR Engine v3.02.02 with Leptonica >> APPLY_BOXES: >> Boxes read from boxfile: 315 >> Found 315 good blobs. >> Leaving 26 unlabelled blobs in 0 words. >> TRAINING ... Font name = icr >> Generated training data for 18 words >> ./unicharset_extractor eng.icr.exp0.box >> ./shapeclustering -F font_properties -U unicharset eng.icr.exp0.tr >> Reading eng.icr.exp0.tr ... >> Building master shape table >> Computing shape distances... >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 >> 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 >> Distance = 0.007463: Stopped with 1 merged, min dist 0.101266 >> Master shape_table:Number of shapes = 36 max unichars = 2 number with >> multiple unichars = 1 >> ./cntraining eng.icr.exp0.tr >> Reading eng.icr.exp0.tr ... >> Clustering ... >> >> Writing normproto ... >> mv unichartset icr.unicharset >> mv shapetable icr.shapetable >> mv normproto icr.normproto >> mv pffmtable icr.pffmtable >> mv inttemp icr.inttemp >> >> ./combine_tessdata icr. >> TessdataManager combined tesseract data files. >> Offset for type 0 is -1 >> Offset for type 1 is 140 >> Offset for type 2 is -1 >> Offset for type 3 is -1 >> Offset for type 4 is -1 >> Offset for type 5 is 2528 >> Offset for type 6 is -1 >> Offset for type 7 is -1 >> Offset for type 8 is -1 >> Offset for type 9 is -1 >> Offset for type 10 is -1 >> Offset for type 11 is -1 >> Offset for type 12 is -1 >> Offset for type 13 is 7841 >> Offset for type 14 is -1 >> Offset for type 15 is -1 >> Offset for type 16 is -1 >> >> -- >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected]<javascript:> >> To unsubscribe from this group, send email to >> [email protected] <javascript:> >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> >> --- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> For more options, visit https://groups.google.com/groups/opt_out. >> >> >> > > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

