It appears that you left out some steps, such as file rename.

If por is the language code, then the combine command would be:

combine_tessdata por.

Use a training tool, such as jTessBoxEditor 
<http://vietocr.sourceforge.net/training.html>, if possible.

On Thursday, May 7, 2015 at 1:00:45 AM UTC-5, Miguel Goyanes wrote:
>
> Hello.
>
> I'm reading the tutorial on hoe to create trainedata and I came across the 
> error in the title jsut at the last part.
>
> I'm running Ubuntu 14.04 LTS on a Virtual machine.
> Tesseract version is 3.02.02 abd leptonica is 1.72
>
> Hi have a por.monospaced.exp0.tif file and here are the steps I've made 
> and resulting output
>
> miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif 
>> por.monospaced.exp0 batch.nochop makebox
>> Tesseract Open Source OCR Engine v3.02.02 with Leptonica
>
>
>
> Then:
>
> miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif 
> por.monospaced.exp0 box.train
> Tesseract Open Source OCR Engine v3.02.02 with Leptonica
> APPLY_BOXES:
>    Boxes read from boxfile:      59
>    Found 59 good blobs.
>    Leaving 2 unlabelled blobs in 0 words.
> TRAINING ... Font name = monospaced
> Generated training data for 4 words
>
>
> And  
>
> miguel@miguel:~/Desktop/TessTests$ unicharset_extractor 
> por.monospaced.exp0.box
> Extracting unicharset from por.monospaced.exp0.box
> Wrote unicharset file ./unicharset.
>
>
> I've created the font_properties file
>  
>
> miguel@miguel:~/Desktop/TessTests$ echo monospaced 0 0 0 0 0 > 
> font_properties
>
>
> And
>
> miguel@miguel:~/Desktop/TessTests$ shapeclustering -F font_properties -U 
> unicharset por.monospaced.exp0.tr
> Reading por.monospaced.exp0.tr ...
> Building master shape table
> Computing shape distances...
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances...
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances...
> Stopped with 0 merged, min dist 999.000000
> Computing shape distances... 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 
> 18 19 20
> Distance = 0.000000: Stopped with 1 merged, min dist 0.165217
> Master shape_table:Number of shapes = 20 max unichars = 2 number with 
> multiple unichars = 1
> miguel@miguel:~/Desktop/TessTests$ mftraining -F font_properties -U 
> unicharset -O por.unicharset por.monospaced.exp0.tr
> Read shape table shapetable of 20 shapes
> Reading por.monospaced.exp0.tr ...
> Done!
>
>
> And then
>
> miguel@miguel:~/Desktop/TessTests$ cntraining por.monospaced.exp0.tr
> Reading por.monospaced.exp0.tr ...
> Clustering ...
>
> Writing normproto ...
>
>
> Finally,in the last command I'm getting the error:
>
> miguel@miguel:~/Desktop/TessTests$ combine_tessdata pass.
> Combining tessdata files
> Error opening unicharset file
> Error combining tessdata files into pass.traineddata
>
>
>
> What am I doing  wrong?
>
> I've attached all the files.
>
> Thanks 
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>  
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c1190f7e-ba76-4473-a5ff-e9a1857f2016%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to