Great to hear that you successfully generated Kannada traineddata using trestrain.sh
Did you test to see whether there is difference/improvement in recognition compared to the kan traineddata provided by Google? The terminal extract also indicated a 'flat shape table' . - sent from my phone. excuse the brevity. On 27-Nov-2015 8:59 pm, "Sriranga(83yrsold)" < [email protected]> wrote: > In coninuation of my previous post - I like to inform that also succeeded > to generate the kan.traineddata file in tesseract-3.05.0Dev using > tesstrain.sh. > I am thankful to all concerned who helped me to solve the problem. > Good Luck. > > On Fri, Nov 27, 2015 at 6:45 PM, Sriranga(83yrsold) < > [email protected]> wrote: > >> HI >> After several attempts- for more than two days, now >> Successfully generated kan.traineddata file in ubuntu 15.10 using >> tesstrain.sh of tesseract-3.04. >> Attached terminal extract for benefit of users. since kan.traineddata >> exceeds 25mb - could not attached herewith. Please note all fonts listed in >> language-specific.sh did not work for kan - resulting failures. I don't >> know reason why it does not work? >> with best of luck, >> sriranga(83) >> >> On Tue, Nov 17, 2015 at 10:43 PM, Sriranga(83yrsold) < >> [email protected]> wrote: >> >>> Marco, >>> from where I can download the packaged the training utilities for >>> 3.04.00 - since it contains tesstrain.sh? I wanted to generate >>> kan.trainedata file using "tesstrain.sh" in cygwin and test >>> I may Kindly be intimated the step by step procedure to be followed. On >>> receipt I shall test for lang -Kan and feedback to you. >>> With warmest regards,sriranga(83) >>> >>> On Sun, Nov 15, 2015 at 11:30 PM, Marco Atzeri <[email protected]> >>> wrote: >>> >>>> On 15/11/2015 18:45, Nick White wrote: >>>> >>>>> On Sun, Nov 15, 2015 at 09:16:29PM +0530, Sriranga(83yrsold) wrote: >>>>> >>>>>> Dear nick, >>>>>> kindly clarify whether "make" file will work on windows "vista" since >>>>>> binaries >>>>>> for windows are not available for download at present? If so how to >>>>>> do? >>>>>> >>>>> >>>>> No, it won't work on Windows, and I have no plans to make it do so. >>>>> The Tesseract training tools it uses (tesstrain.sh etc.) don't work >>>>> on Windows either, so there's no point in me spending time getting >>>>> my tools to work on it. Besides, I am tired of wrestling with >>>>> getting things to work on Windows these days. >>>>> >>>>> You could probably get it to work with Cygwin, if you really needed >>>>> to, but I don't have the time, interest or knowledge to walk you >>>>> through the exact steps. >>>>> >>>>> Nick >>>>> >>>> >>>> On cygwin I already packaged the training utilities for 3.04.00. >>>> and some training data. >>>> >>>> If anything else is needed, or does not work properly, >>>> I will work on it. >>>> >>>> $ cygcheck -l tesseract-training-util >>>> /usr/bin/ambiguous_words.exe >>>> /usr/bin/classifier_tester.exe >>>> /usr/bin/cntraining.exe >>>> /usr/bin/combine_tessdata.exe >>>> /usr/bin/dawg2wordlist.exe >>>> /usr/bin/mftraining.exe >>>> /usr/bin/set_unicharset_properties.exe >>>> /usr/bin/shapeclustering.exe >>>> /usr/bin/text2image.exe >>>> /usr/bin/unicharset_extractor.exe >>>> /usr/bin/wordlist2dawg.exe >>>> /usr/bin/language-specific.sh >>>> /usr/bin/tesstrain.sh >>>> /usr/bin/tesstrain_utils.sh >>>> >>>> $ cygcheck -l tesseract-training-core >>>> /usr/share/tessdata/training/Arabic.unicharset >>>> /usr/share/tessdata/training/Arabic.xheights >>>> /usr/share/tessdata/training/Armenian.unicharset >>>> /usr/share/tessdata/training/Armenian.xheights >>>> /usr/share/tessdata/training/Bengali.unicharset >>>> /usr/share/tessdata/training/Bengali.xheights >>>> /usr/share/tessdata/training/Bopomofo.unicharset >>>> /usr/share/tessdata/training/Bopomofo.xheights >>>> /usr/share/tessdata/training/Canadian_Aboriginal.unicharset >>>> /usr/share/tessdata/training/Canadian_Aboriginal.xheights >>>> /usr/share/tessdata/training/Cherokee.unicharset >>>> /usr/share/tessdata/training/Cherokee.xheights >>>> /usr/share/tessdata/training/common.punc >>>> /usr/share/tessdata/training/common.unicharambigs >>>> /usr/share/tessdata/training/Common.unicharset >>>> /usr/share/tessdata/training/Cyrillic.unicharset >>>> /usr/share/tessdata/training/Cyrillic.xheights >>>> /usr/share/tessdata/training/Devanagari.unicharset >>>> /usr/share/tessdata/training/Devanagari.xheights >>>> /usr/share/tessdata/training/Ethiopic.unicharset >>>> /usr/share/tessdata/training/Ethiopic.xheights >>>> /usr/share/tessdata/training/font_properties >>>> /usr/share/tessdata/training/forbidden_characters_default >>>> /usr/share/tessdata/training/Georgian.unicharset >>>> /usr/share/tessdata/training/Georgian.xheights >>>> /usr/share/tessdata/training/Greek.unicharset >>>> /usr/share/tessdata/training/Greek.xheights >>>> /usr/share/tessdata/training/Gujarati.unicharset >>>> /usr/share/tessdata/training/Gujarati.xheights >>>> /usr/share/tessdata/training/Gurmukhi.unicharset >>>> /usr/share/tessdata/training/Gurmukhi.xheights >>>> /usr/share/tessdata/training/Han.unicharset >>>> /usr/share/tessdata/training/Han.xheights >>>> /usr/share/tessdata/training/Hangul.unicharset >>>> /usr/share/tessdata/training/Hangul.xheights >>>> /usr/share/tessdata/training/Hebrew.unicharset >>>> /usr/share/tessdata/training/Hebrew.xheights >>>> /usr/share/tessdata/training/Hiragana.unicharset >>>> /usr/share/tessdata/training/Hiragana.xheights >>>> /usr/share/tessdata/training/Kannada.unicharset >>>> /usr/share/tessdata/training/Kannada.xheights >>>> /usr/share/tessdata/training/Katakana.unicharset >>>> /usr/share/tessdata/training/Katakana.xheights >>>> /usr/share/tessdata/training/Khmer.unicharset >>>> /usr/share/tessdata/training/Khmer.xheights >>>> /usr/share/tessdata/training/Lao.unicharset >>>> /usr/share/tessdata/training/Lao.xheights >>>> /usr/share/tessdata/training/Latin.unicharset >>>> /usr/share/tessdata/training/Latin.xheights >>>> /usr/share/tessdata/training/Malayalam.unicharset >>>> /usr/share/tessdata/training/Malayalam.xheights >>>> /usr/share/tessdata/training/Myanmar.unicharset >>>> /usr/share/tessdata/training/Myanmar.xheights >>>> /usr/share/tessdata/training/Ogham.unicharset >>>> /usr/share/tessdata/training/Ogham.xheights >>>> /usr/share/tessdata/training/Oriya.unicharset >>>> /usr/share/tessdata/training/Oriya.xheights >>>> /usr/share/tessdata/training/Runic.unicharset >>>> /usr/share/tessdata/training/Runic.xheights >>>> /usr/share/tessdata/training/Sinhala.unicharset >>>> /usr/share/tessdata/training/Sinhala.xheights >>>> /usr/share/tessdata/training/Syriac.unicharset >>>> /usr/share/tessdata/training/Syriac.xheights >>>> /usr/share/tessdata/training/Tamil.unicharset >>>> /usr/share/tessdata/training/Tamil.xheights >>>> /usr/share/tessdata/training/Telugu.unicharset >>>> /usr/share/tessdata/training/Telugu.xheights >>>> /usr/share/tessdata/training/Thai.unicharset >>>> /usr/share/tessdata/training/Thai.xheights >>>> /usr/share/tessdata/training/Tibetan.unicharset >>>> >>>> >>>> Marco >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at http://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/5648C854.20700%40gmail.com >>>> . >>>> >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CANKD7YztLqEn9jGA1DogYC9wMZjZndHHMFsS%2BUpzoVQfQV%2BTvQ%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CANKD7YztLqEn9jGA1DogYC9wMZjZndHHMFsS%2BUpzoVQfQV%2BTvQ%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUGkZhg5PH%2BSWVV7ATqhc2%3DA86Gr4-sNU_u0NjTzGUqzw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

