Combine-lang-model only creates the starter traineddata. It is used as part of lstm training process. It cannot be used for recognition.
Training from scratch requires running the lstmtraing command. On Mon, Sep 9, 2019, 21:36 Nuno Feliciano <[email protected]> wrote: > > > > > Hi, > > I am trying to make a model from scratch. > I created a language using > combine_lang_model --input_unicharset > D:\software\Tesseract-OCR-4.0\tessdata\Latin.unicharset --script_dir > D:\software\Tesseract-OCR-4.0\tessdata --output_dir > D:\software\Tesseract-OCR-4.0\training\output *--lang ccy* > Than I put the generated ccy.traineddata file in tessdata folder and I > execute > tesseract --tessdata-dir D:\software\Tesseract-OCR-4.0\tessdata -l ccy > <file> stdout, which gives me > *Failed loading language 'ccy'* > Tesseract couldn't load any languages! > Could not initialize tesseract. > > tesseract --list-langs gives me > ccy > eng > osd > ... > > I got Latin.unicharset from > https://raw.githubusercontent.com/tesseract-ocr/langdata_lstm/master/Latin.unicharset > > Can anyone help? > > Thanks, > Nuno Feliciano > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/f0157ef9-7b83-4fa3-8cf5-3697514d6de0%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/f0157ef9-7b83-4fa3-8cf5-3697514d6de0%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWbPxJJ%2BuXUMXuTg3mZsxWEu2Q63d-5pqTg1TqRrG_exA%40mail.gmail.com.

