Sorry for the late response.

Yep. Renamed and all went fine.

Thanks.

On Friday, May 8, 2015 at 8:32:07 PM UTC+1, Quan Nguyen wrote:
>
> It appears that you left out some steps, such as file rename.
>
> If por is the language code, then the combine command would be:
>
> combine_tessdata por.
>
> Use a training tool, such as jTessBoxEditor 
> <http://vietocr.sourceforge.net/training.html>, if possible.
>
> On Thursday, May 7, 2015 at 1:00:45 AM UTC-5, Miguel Goyanes wrote:
>>
>> Hello.
>>
>> I'm reading the tutorial on hoe to create trainedata and I came across 
>> the error in the title jsut at the last part.
>>
>> I'm running Ubuntu 14.04 LTS on a Virtual machine.
>> Tesseract version is 3.02.02 abd leptonica is 1.72
>>
>> Hi have a por.monospaced.exp0.tif file and here are the steps I've made 
>> and resulting output
>>
>> miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif 
>>> por.monospaced.exp0 batch.nochop makebox
>>> Tesseract Open Source OCR Engine v3.02.02 with Leptonica
>>
>>
>>
>> Then:
>>
>> miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif 
>> por.monospaced.exp0 box.train
>> Tesseract Open Source OCR Engine v3.02.02 with Leptonica
>> APPLY_BOXES:
>>    Boxes read from boxfile:      59
>>    Found 59 good blobs.
>>    Leaving 2 unlabelled blobs in 0 words.
>> TRAINING ... Font name = monospaced
>> Generated training data for 4 words
>>
>>
>> And  
>>
>> miguel@miguel:~/Desktop/TessTests$ unicharset_extractor 
>> por.monospaced.exp0.box
>> Extracting unicharset from por.monospaced.exp0.box
>> Wrote unicharset file ./unicharset.
>>
>>
>> I've created the font_properties file
>>  
>>
>> miguel@miguel:~/Desktop/TessTests$ echo monospaced 0 0 0 0 0 > 
>> font_properties
>>
>>
>> And
>>
>> miguel@miguel:~/Desktop/TessTests$ shapeclustering -F font_properties -U 
>> unicharset por.monospaced.exp0.tr
>> Reading por.monospaced.exp0.tr ...
>> Building master shape table
>> Computing shape distances...
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances...
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances...
>> Stopped with 0 merged, min dist 999.000000
>> Computing shape distances... 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 
>> 18 19 20
>> Distance = 0.000000: Stopped with 1 merged, min dist 0.165217
>> Master shape_table:Number of shapes = 20 max unichars = 2 number with 
>> multiple unichars = 1
>> miguel@miguel:~/Desktop/TessTests$ mftraining -F font_properties -U 
>> unicharset -O por.unicharset por.monospaced.exp0.tr
>> Read shape table shapetable of 20 shapes
>> Reading por.monospaced.exp0.tr ...
>> Done!
>>
>>
>> And then
>>
>> miguel@miguel:~/Desktop/TessTests$ cntraining por.monospaced.exp0.tr
>> Reading por.monospaced.exp0.tr ...
>> Clustering ...
>>
>> Writing normproto ...
>>
>>
>> Finally,in the last command I'm getting the error:
>>
>> miguel@miguel:~/Desktop/TessTests$ combine_tessdata pass.
>> Combining tessdata files
>> Error opening unicharset file
>> Error combining tessdata files into pass.traineddata
>>
>>
>>
>> What am I doing  wrong?
>>
>> I've attached all the files.
>>
>> Thanks 
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>  
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b7c98ad8-316c-473f-851f-e619b9e921ea%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to