Great to hear that you successfully generated Kannada traineddata using
trestrain.sh

Did you test to see whether there is difference/improvement in recognition
compared to the kan traineddata provided by Google?

The terminal extract also indicated a 'flat shape table' .

- sent from my phone. excuse the brevity.
On 27-Nov-2015 8:59 pm, "Sriranga(83yrsold)" <
[email protected]> wrote:

> In coninuation of my previous post - I like to inform that also succeeded
> to generate the kan.traineddata file in tesseract-3.05.0Dev using
> tesstrain.sh.
> I am thankful to all concerned who helped me to solve the problem.
> Good Luck.
>
> On Fri, Nov 27, 2015 at 6:45 PM, Sriranga(83yrsold) <
> [email protected]> wrote:
>
>> HI
>> After several attempts- for more than two days, now
>> Successfully generated kan.traineddata file in ubuntu 15.10 using
>> tesstrain.sh of tesseract-3.04.
>> Attached terminal extract for benefit of users. since kan.traineddata
>> exceeds 25mb - could not attached herewith. Please note all fonts listed in
>> language-specific.sh did  not work for kan - resulting failures. I don't
>> know reason why it does not work?
>> with best of luck,
>> sriranga(83)
>>
>> On Tue, Nov 17, 2015 at 10:43 PM, Sriranga(83yrsold) <
>> [email protected]> wrote:
>>
>>> Marco,
>>> from where I can download  the packaged the training utilities for
>>> 3.04.00 - since it contains tesstrain.sh? I wanted to generate
>>> kan.trainedata file using "tesstrain.sh" in cygwin and  test
>>> I may Kindly be intimated  the step by step procedure to be followed. On
>>> receipt I shall test for lang -Kan and feedback to you.
>>> With warmest regards,sriranga(83)
>>>
>>> On Sun, Nov 15, 2015 at 11:30 PM, Marco Atzeri <[email protected]>
>>> wrote:
>>>
>>>> On 15/11/2015 18:45, Nick White wrote:
>>>>
>>>>> On Sun, Nov 15, 2015 at 09:16:29PM +0530, Sriranga(83yrsold) wrote:
>>>>>
>>>>>> Dear nick,
>>>>>> kindly clarify whether "make" file will work on windows "vista" since
>>>>>> binaries
>>>>>> for windows are not available for download at present? If so how to
>>>>>> do?
>>>>>>
>>>>>
>>>>> No, it won't work on Windows, and I have no plans to make it do so.
>>>>> The Tesseract training tools it uses (tesstrain.sh etc.) don't work
>>>>> on Windows either, so there's no point in me spending time getting
>>>>> my tools to work on it. Besides, I am tired of wrestling with
>>>>> getting things to work on Windows these days.
>>>>>
>>>>> You could probably get it to work with Cygwin, if you really needed
>>>>> to, but I don't have the time, interest or knowledge to walk you
>>>>> through the exact steps.
>>>>>
>>>>> Nick
>>>>>
>>>>
>>>> On cygwin I already packaged the training utilities for 3.04.00.
>>>> and some training data.
>>>>
>>>> If anything else is needed, or does not work properly,
>>>> I will work on it.
>>>>
>>>> $ cygcheck -l tesseract-training-util
>>>> /usr/bin/ambiguous_words.exe
>>>> /usr/bin/classifier_tester.exe
>>>> /usr/bin/cntraining.exe
>>>> /usr/bin/combine_tessdata.exe
>>>> /usr/bin/dawg2wordlist.exe
>>>> /usr/bin/mftraining.exe
>>>> /usr/bin/set_unicharset_properties.exe
>>>> /usr/bin/shapeclustering.exe
>>>> /usr/bin/text2image.exe
>>>> /usr/bin/unicharset_extractor.exe
>>>> /usr/bin/wordlist2dawg.exe
>>>> /usr/bin/language-specific.sh
>>>> /usr/bin/tesstrain.sh
>>>> /usr/bin/tesstrain_utils.sh
>>>>
>>>> $ cygcheck -l tesseract-training-core
>>>> /usr/share/tessdata/training/Arabic.unicharset
>>>> /usr/share/tessdata/training/Arabic.xheights
>>>> /usr/share/tessdata/training/Armenian.unicharset
>>>> /usr/share/tessdata/training/Armenian.xheights
>>>> /usr/share/tessdata/training/Bengali.unicharset
>>>> /usr/share/tessdata/training/Bengali.xheights
>>>> /usr/share/tessdata/training/Bopomofo.unicharset
>>>> /usr/share/tessdata/training/Bopomofo.xheights
>>>> /usr/share/tessdata/training/Canadian_Aboriginal.unicharset
>>>> /usr/share/tessdata/training/Canadian_Aboriginal.xheights
>>>> /usr/share/tessdata/training/Cherokee.unicharset
>>>> /usr/share/tessdata/training/Cherokee.xheights
>>>> /usr/share/tessdata/training/common.punc
>>>> /usr/share/tessdata/training/common.unicharambigs
>>>> /usr/share/tessdata/training/Common.unicharset
>>>> /usr/share/tessdata/training/Cyrillic.unicharset
>>>> /usr/share/tessdata/training/Cyrillic.xheights
>>>> /usr/share/tessdata/training/Devanagari.unicharset
>>>> /usr/share/tessdata/training/Devanagari.xheights
>>>> /usr/share/tessdata/training/Ethiopic.unicharset
>>>> /usr/share/tessdata/training/Ethiopic.xheights
>>>> /usr/share/tessdata/training/font_properties
>>>> /usr/share/tessdata/training/forbidden_characters_default
>>>> /usr/share/tessdata/training/Georgian.unicharset
>>>> /usr/share/tessdata/training/Georgian.xheights
>>>> /usr/share/tessdata/training/Greek.unicharset
>>>> /usr/share/tessdata/training/Greek.xheights
>>>> /usr/share/tessdata/training/Gujarati.unicharset
>>>> /usr/share/tessdata/training/Gujarati.xheights
>>>> /usr/share/tessdata/training/Gurmukhi.unicharset
>>>> /usr/share/tessdata/training/Gurmukhi.xheights
>>>> /usr/share/tessdata/training/Han.unicharset
>>>> /usr/share/tessdata/training/Han.xheights
>>>> /usr/share/tessdata/training/Hangul.unicharset
>>>> /usr/share/tessdata/training/Hangul.xheights
>>>> /usr/share/tessdata/training/Hebrew.unicharset
>>>> /usr/share/tessdata/training/Hebrew.xheights
>>>> /usr/share/tessdata/training/Hiragana.unicharset
>>>> /usr/share/tessdata/training/Hiragana.xheights
>>>> /usr/share/tessdata/training/Kannada.unicharset
>>>> /usr/share/tessdata/training/Kannada.xheights
>>>> /usr/share/tessdata/training/Katakana.unicharset
>>>> /usr/share/tessdata/training/Katakana.xheights
>>>> /usr/share/tessdata/training/Khmer.unicharset
>>>> /usr/share/tessdata/training/Khmer.xheights
>>>> /usr/share/tessdata/training/Lao.unicharset
>>>> /usr/share/tessdata/training/Lao.xheights
>>>> /usr/share/tessdata/training/Latin.unicharset
>>>> /usr/share/tessdata/training/Latin.xheights
>>>> /usr/share/tessdata/training/Malayalam.unicharset
>>>> /usr/share/tessdata/training/Malayalam.xheights
>>>> /usr/share/tessdata/training/Myanmar.unicharset
>>>> /usr/share/tessdata/training/Myanmar.xheights
>>>> /usr/share/tessdata/training/Ogham.unicharset
>>>> /usr/share/tessdata/training/Ogham.xheights
>>>> /usr/share/tessdata/training/Oriya.unicharset
>>>> /usr/share/tessdata/training/Oriya.xheights
>>>> /usr/share/tessdata/training/Runic.unicharset
>>>> /usr/share/tessdata/training/Runic.xheights
>>>> /usr/share/tessdata/training/Sinhala.unicharset
>>>> /usr/share/tessdata/training/Sinhala.xheights
>>>> /usr/share/tessdata/training/Syriac.unicharset
>>>> /usr/share/tessdata/training/Syriac.xheights
>>>> /usr/share/tessdata/training/Tamil.unicharset
>>>> /usr/share/tessdata/training/Tamil.xheights
>>>> /usr/share/tessdata/training/Telugu.unicharset
>>>> /usr/share/tessdata/training/Telugu.xheights
>>>> /usr/share/tessdata/training/Thai.unicharset
>>>> /usr/share/tessdata/training/Thai.xheights
>>>> /usr/share/tessdata/training/Tibetan.unicharset
>>>>
>>>>
>>>> Marco
>>>>
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to [email protected].
>>>> To post to this group, send email to [email protected].
>>>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit
>>>> https://groups.google.com/d/msgid/tesseract-ocr/5648C854.20700%40gmail.com
>>>> .
>>>>
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at http://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CANKD7YztLqEn9jGA1DogYC9wMZjZndHHMFsS%2BUpzoVQfQV%2BTvQ%40mail.gmail.com
> <https://groups.google.com/d/msgid/tesseract-ocr/CANKD7YztLqEn9jGA1DogYC9wMZjZndHHMFsS%2BUpzoVQfQV%2BTvQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUGkZhg5PH%2BSWVV7ATqhc2%3DA86Gr4-sNU_u0NjTzGUqzw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to