see https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain.sh
if ((LINEDATA)); then phase_E_extract_features "lstm.train" 8 "lstmf" make__lstmdata else phase_E_extract_features "box.train" 8 "tr" phase_C_cluster_prototypes "${TRAINING_DIR}/${LANG_CODE}.normproto" if [[ "${ENABLE_SHAPE_CLUSTERING}" == "y" ]]; then phase_S_cluster_shapes fi phase_M_cluster_microfeatures phase_B_generate_ambiguities make__traineddata fi -------------------- lstm.train is for LSTM training box.train is for 3.0 Tesseract legacy engine training Please note that current master code is for alpha testing for 4.0 LSTM and will most probably drop support for legacy engine. If you want the legacy tesseract engine and train for it, please use the 3.05 branch of the github repo. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUfKtJ_Dyxt1RY4_MrpBExSOqbDGi_0sX3rSZzYuKeRzg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.