It appears that you left out some steps, such as file rename. If por is the language code, then the combine command would be:
combine_tessdata por. Use a training tool, such as jTessBoxEditor <http://vietocr.sourceforge.net/training.html>, if possible. On Thursday, May 7, 2015 at 1:00:45 AM UTC-5, Miguel Goyanes wrote: > > Hello. > > I'm reading the tutorial on hoe to create trainedata and I came across the > error in the title jsut at the last part. > > I'm running Ubuntu 14.04 LTS on a Virtual machine. > Tesseract version is 3.02.02 abd leptonica is 1.72 > > Hi have a por.monospaced.exp0.tif file and here are the steps I've made > and resulting output > > miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif >> por.monospaced.exp0 batch.nochop makebox >> Tesseract Open Source OCR Engine v3.02.02 with Leptonica > > > > Then: > > miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif > por.monospaced.exp0 box.train > Tesseract Open Source OCR Engine v3.02.02 with Leptonica > APPLY_BOXES: > Boxes read from boxfile: 59 > Found 59 good blobs. > Leaving 2 unlabelled blobs in 0 words. > TRAINING ... Font name = monospaced > Generated training data for 4 words > > > And > > miguel@miguel:~/Desktop/TessTests$ unicharset_extractor > por.monospaced.exp0.box > Extracting unicharset from por.monospaced.exp0.box > Wrote unicharset file ./unicharset. > > > I've created the font_properties file > > > miguel@miguel:~/Desktop/TessTests$ echo monospaced 0 0 0 0 0 > > font_properties > > > And > > miguel@miguel:~/Desktop/TessTests$ shapeclustering -F font_properties -U > unicharset por.monospaced.exp0.tr > Reading por.monospaced.exp0.tr ... > Building master shape table > Computing shape distances... > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... > Stopped with 0 merged, min dist 999.000000 > Computing shape distances... 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 > 18 19 20 > Distance = 0.000000: Stopped with 1 merged, min dist 0.165217 > Master shape_table:Number of shapes = 20 max unichars = 2 number with > multiple unichars = 1 > miguel@miguel:~/Desktop/TessTests$ mftraining -F font_properties -U > unicharset -O por.unicharset por.monospaced.exp0.tr > Read shape table shapetable of 20 shapes > Reading por.monospaced.exp0.tr ... > Done! > > > And then > > miguel@miguel:~/Desktop/TessTests$ cntraining por.monospaced.exp0.tr > Reading por.monospaced.exp0.tr ... > Clustering ... > > Writing normproto ... > > > Finally,in the last command I'm getting the error: > > miguel@miguel:~/Desktop/TessTests$ combine_tessdata pass. > Combining tessdata files > Error opening unicharset file > Error combining tessdata files into pass.traineddata > > > > What am I doing wrong? > > I've attached all the files. > > Thanks > > > > > > > > > > > > > > > > > > > > > > > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c1190f7e-ba76-4473-a5ff-e9a1857f2016%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

