I appreciate the quick reply! I don't know why the files didn't attach, 
that's very odd - I will have to repost them when I am home and also 
investigate as to if/how I forgot the mftraining step, and if so if that 
solves the issue.
Thanks!

On Tuesday, March 5, 2013 3:21:00 AM UTC-5, zdenop wrote:
>
> There are no atttached data. Maybe try to use some online storage system 
> (google disk, skydrive, dropbox...) and send a link here.
>
> You stated you are following wiki instruction[1], but you log shows it is 
> not true - you did not run mftraining.
>
> [1] http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 
>
> Zdenko
>
>
> On Tue, Mar 5, 2013 at 4:14 AM, A. Naut <[email protected] <javascript:>>wrote:
>
>> I'm trying to train the attached files (Tesseract 3.02, following the 
>> instructions at 
>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 ) , and 
>> although I can compete the training process successfully I can't get 
>> tesseract to work with the produce trainneddata file - I always receive the 
>> error:
>>
>> tessdata_manager.SeekToStart(TESSDATA_INTTEMP):Error:Assert failed:in 
>> file adaptmatch.cpp, line 555
>>
>> I have attached the .box, .tif, and font_properties file I used for 
>> training purposes. (Although the training instructions says to add .exp? 
>> after the font name in the font_properties file, when I use ocr.exp0 as the 
>> font name in that file the shape clustering than fails).
>>
>>
>> The following is the process I use for producing the training file:
>>
>> ./tesseract eng.icr.exp0.tif eng.icr.exp0 nobatch box.train.stderr
>> Tesseract Open Source OCR Engine v3.02.02 with Leptonica
>> APPLY_BOXES:
>>    Boxes read from boxfile:     315
>>    Found 315 good blobs.
>>    Leaving 26 unlabelled blobs in 0 words.
>> TRAINING ... Font name = icr
>> Generated training data for 18 words
>> ./unicharset_extractor  eng.icr.exp0.box
>> ./shapeclustering -F font_properties -U unicharset eng.icr.exp0.tr
>> Reading eng.icr.exp0.tr ...
>> Building master shape table
>> Computing shape distances...
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances...
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances...
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 
>> 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36
>> Distance = 0.007463: Stopped with 1 merged, min dist 0.101266
>> Master shape_table:Number of shapes = 36 max unichars = 2 number with 
>> multiple unichars = 1
>> ./cntraining  eng.icr.exp0.tr
>> Reading eng.icr.exp0.tr ...
>> Clustering ...
>>
>> Writing normproto ...
>> mv unichartset icr.unicharset
>> mv shapetable icr.shapetable
>> mv normproto icr.normproto
>> mv pffmtable icr.pffmtable
>> mv inttemp icr.inttemp
>>
>> ./combine_tessdata icr.
>> TessdataManager combined tesseract data files.
>> Offset for type 0 is -1
>> Offset for type 1 is 140
>> Offset for type 2 is -1
>> Offset for type 3 is -1
>> Offset for type 4 is -1
>> Offset for type 5 is 2528
>> Offset for type 6 is -1
>> Offset for type 7 is -1
>> Offset for type 8 is -1
>> Offset for type 9 is -1
>> Offset for type 10 is -1
>> Offset for type 11 is -1
>> Offset for type 12 is -1
>> Offset for type 13 is 7841
>> Offset for type 14 is -1
>> Offset for type 15 is -1
>> Offset for type 16 is -1
>>
>> -- 
>> -- 
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]<javascript:>
>> To unsubscribe from this group, send email to
>> [email protected] <javascript:>
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>>  
>> --- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> For more options, visit https://groups.google.com/groups/opt_out.
>>  
>>  
>>
>
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to