It would be nice to indicate step by step procedure to be followed for the generating <lang.traineddata file using cygwin on MS windows platform - for bvenefit of newbie
On Sun, Nov 15, 2015 at 11:30 PM, Marco Atzeri <[email protected]> wrote: > On 15/11/2015 18:45, Nick White wrote: > >> On Sun, Nov 15, 2015 at 09:16:29PM +0530, Sriranga(83yrsold) wrote: >> >>> Dear nick, >>> kindly clarify whether "make" file will work on windows "vista" since >>> binaries >>> for windows are not available for download at present? If so how to do? >>> >> >> No, it won't work on Windows, and I have no plans to make it do so. >> The Tesseract training tools it uses (tesstrain.sh etc.) don't work >> on Windows either, so there's no point in me spending time getting >> my tools to work on it. Besides, I am tired of wrestling with >> getting things to work on Windows these days. >> >> You could probably get it to work with Cygwin, if you really needed >> to, but I don't have the time, interest or knowledge to walk you >> through the exact steps. >> >> Nick >> > > On cygwin I already packaged the training utilities for 3.04.00. > and some training data. > > If anything else is needed, or does not work properly, > I will work on it. > > $ cygcheck -l tesseract-training-util > /usr/bin/ambiguous_words.exe > /usr/bin/classifier_tester.exe > /usr/bin/cntraining.exe > /usr/bin/combine_tessdata.exe > /usr/bin/dawg2wordlist.exe > /usr/bin/mftraining.exe > /usr/bin/set_unicharset_properties.exe > /usr/bin/shapeclustering.exe > /usr/bin/text2image.exe > /usr/bin/unicharset_extractor.exe > /usr/bin/wordlist2dawg.exe > /usr/bin/language-specific.sh > /usr/bin/tesstrain.sh > /usr/bin/tesstrain_utils.sh > > $ cygcheck -l tesseract-training-core > /usr/share/tessdata/training/Arabic.unicharset > /usr/share/tessdata/training/Arabic.xheights > /usr/share/tessdata/training/Armenian.unicharset > /usr/share/tessdata/training/Armenian.xheights > /usr/share/tessdata/training/Bengali.unicharset > /usr/share/tessdata/training/Bengali.xheights > /usr/share/tessdata/training/Bopomofo.unicharset > /usr/share/tessdata/training/Bopomofo.xheights > /usr/share/tessdata/training/Canadian_Aboriginal.unicharset > /usr/share/tessdata/training/Canadian_Aboriginal.xheights > /usr/share/tessdata/training/Cherokee.unicharset > /usr/share/tessdata/training/Cherokee.xheights > /usr/share/tessdata/training/common.punc > /usr/share/tessdata/training/common.unicharambigs > /usr/share/tessdata/training/Common.unicharset > /usr/share/tessdata/training/Cyrillic.unicharset > /usr/share/tessdata/training/Cyrillic.xheights > /usr/share/tessdata/training/Devanagari.unicharset > /usr/share/tessdata/training/Devanagari.xheights > /usr/share/tessdata/training/Ethiopic.unicharset > /usr/share/tessdata/training/Ethiopic.xheights > /usr/share/tessdata/training/font_properties > /usr/share/tessdata/training/forbidden_characters_default > /usr/share/tessdata/training/Georgian.unicharset > /usr/share/tessdata/training/Georgian.xheights > /usr/share/tessdata/training/Greek.unicharset > /usr/share/tessdata/training/Greek.xheights > /usr/share/tessdata/training/Gujarati.unicharset > /usr/share/tessdata/training/Gujarati.xheights > /usr/share/tessdata/training/Gurmukhi.unicharset > /usr/share/tessdata/training/Gurmukhi.xheights > /usr/share/tessdata/training/Han.unicharset > /usr/share/tessdata/training/Han.xheights > /usr/share/tessdata/training/Hangul.unicharset > /usr/share/tessdata/training/Hangul.xheights > /usr/share/tessdata/training/Hebrew.unicharset > /usr/share/tessdata/training/Hebrew.xheights > /usr/share/tessdata/training/Hiragana.unicharset > /usr/share/tessdata/training/Hiragana.xheights > /usr/share/tessdata/training/Kannada.unicharset > /usr/share/tessdata/training/Kannada.xheights > /usr/share/tessdata/training/Katakana.unicharset > /usr/share/tessdata/training/Katakana.xheights > /usr/share/tessdata/training/Khmer.unicharset > /usr/share/tessdata/training/Khmer.xheights > /usr/share/tessdata/training/Lao.unicharset > /usr/share/tessdata/training/Lao.xheights > /usr/share/tessdata/training/Latin.unicharset > /usr/share/tessdata/training/Latin.xheights > /usr/share/tessdata/training/Malayalam.unicharset > /usr/share/tessdata/training/Malayalam.xheights > /usr/share/tessdata/training/Myanmar.unicharset > /usr/share/tessdata/training/Myanmar.xheights > /usr/share/tessdata/training/Ogham.unicharset > /usr/share/tessdata/training/Ogham.xheights > /usr/share/tessdata/training/Oriya.unicharset > /usr/share/tessdata/training/Oriya.xheights > /usr/share/tessdata/training/Runic.unicharset > /usr/share/tessdata/training/Runic.xheights > /usr/share/tessdata/training/Sinhala.unicharset > /usr/share/tessdata/training/Sinhala.xheights > /usr/share/tessdata/training/Syriac.unicharset > /usr/share/tessdata/training/Syriac.xheights > /usr/share/tessdata/training/Tamil.unicharset > /usr/share/tessdata/training/Tamil.xheights > /usr/share/tessdata/training/Telugu.unicharset > /usr/share/tessdata/training/Telugu.xheights > /usr/share/tessdata/training/Thai.unicharset > /usr/share/tessdata/training/Thai.xheights > /usr/share/tessdata/training/Tibetan.unicharset > > > Marco > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/5648C854.20700%40gmail.com > . > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CANKD7YxJ2ZmT-17sgZ-Z7TCOOY4NZ6Hec4kD4gb%2B5hTb2HRZdw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

