Sorry for the late response. Yep. Renamed and all went fine.
Thanks. On Friday, May 8, 2015 at 8:32:07 PM UTC+1, Quan Nguyen wrote: > > It appears that you left out some steps, such as file rename. > > If por is the language code, then the combine command would be: > > combine_tessdata por. > > Use a training tool, such as jTessBoxEditor > <http://vietocr.sourceforge.net/training.html>, if possible. > > On Thursday, May 7, 2015 at 1:00:45 AM UTC-5, Miguel Goyanes wrote: >> >> Hello. >> >> I'm reading the tutorial on hoe to create trainedata and I came across >> the error in the title jsut at the last part. >> >> I'm running Ubuntu 14.04 LTS on a Virtual machine. >> Tesseract version is 3.02.02 abd leptonica is 1.72 >> >> Hi have a por.monospaced.exp0.tif file and here are the steps I've made >> and resulting output >> >> miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif >>> por.monospaced.exp0 batch.nochop makebox >>> Tesseract Open Source OCR Engine v3.02.02 with Leptonica >> >> >> >> Then: >> >> miguel@miguel:~/Desktop/TessTests$ tesseract por.monospaced.exp0.tif >> por.monospaced.exp0 box.train >> Tesseract Open Source OCR Engine v3.02.02 with Leptonica >> APPLY_BOXES: >> Boxes read from boxfile: 59 >> Found 59 good blobs. >> Leaving 2 unlabelled blobs in 0 words. >> TRAINING ... Font name = monospaced >> Generated training data for 4 words >> >> >> And >> >> miguel@miguel:~/Desktop/TessTests$ unicharset_extractor >> por.monospaced.exp0.box >> Extracting unicharset from por.monospaced.exp0.box >> Wrote unicharset file ./unicharset. >> >> >> I've created the font_properties file >> >> >> miguel@miguel:~/Desktop/TessTests$ echo monospaced 0 0 0 0 0 > >> font_properties >> >> >> And >> >> miguel@miguel:~/Desktop/TessTests$ shapeclustering -F font_properties -U >> unicharset por.monospaced.exp0.tr >> Reading por.monospaced.exp0.tr ... >> Building master shape table >> Computing shape distances... >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... >> Stopped with 0 merged, min dist 999.000000 >> Computing shape distances... 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 >> 18 19 20 >> Distance = 0.000000: Stopped with 1 merged, min dist 0.165217 >> Master shape_table:Number of shapes = 20 max unichars = 2 number with >> multiple unichars = 1 >> miguel@miguel:~/Desktop/TessTests$ mftraining -F font_properties -U >> unicharset -O por.unicharset por.monospaced.exp0.tr >> Read shape table shapetable of 20 shapes >> Reading por.monospaced.exp0.tr ... >> Done! >> >> >> And then >> >> miguel@miguel:~/Desktop/TessTests$ cntraining por.monospaced.exp0.tr >> Reading por.monospaced.exp0.tr ... >> Clustering ... >> >> Writing normproto ... >> >> >> Finally,in the last command I'm getting the error: >> >> miguel@miguel:~/Desktop/TessTests$ combine_tessdata pass. >> Combining tessdata files >> Error opening unicharset file >> Error combining tessdata files into pass.traineddata >> >> >> >> What am I doing wrong? >> >> I've attached all the files. >> >> Thanks >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b7c98ad8-316c-473f-851f-e619b9e921ea%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

