That command applies to an older version of the source code. Now you need a starter traineddata.
Please see the wiki page at https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#training-just-a-few-layers ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Tue, Mar 13, 2018 at 1:00 PM, 이경준 <[email protected]> wrote: > https://github.com/tesseract-ocr/tesseract/issues/549 > > > > @harinath141 <https://github.com/harinath141> If you are getting a lot of > these errors during finetune, try replace top layer training. You can use > the box/tiff pairs generated for finetune. Commands will be similar to the > following: > > mkdir -p ~/tesstutorial/tellayer_from_tel > > combine_tessdata -e ../tessdata/tel.traineddata \ > ~/tesstutorial/tellayer_from_tel/tel.lstm > > lstmtraining -U ~/tesstutorial/tel/tel.unicharset \ > --script_dir ../langdata --debug_interval 0 \ > --continue_from ~/tesstutorial/tellayer_from_tel/tel.lstm \ > --append_index 5 --net_spec '[Lfx256 O1c105]' \ > --model_output ~/tesstutorial/tellayer_from_tel/tellayer \ > --train_listfile ~/tesstutorial/tel/tel.training_files.txt \ > --target_error_rate 0.01 > > > > I found the article you wrote > > > but --script_dir doesn't work in the lstmtraining ? > > > How do I change this option(flag) ??? what is replaced by that phrase > > > 2018년 3월 13일 화요일 오후 4시 24분 52초 UTC+9, shree 님의 말: >> >> That info is given in the training wiki page. >> >> On Tue 13 Mar, 2018, 12:53 PM 이경준, <[email protected]> wrote: >> >>> There is no way about replacing top layer ... ㅜㅜ >>> >>> 2018년 3월 13일 화요일 오후 4시 22분 8초 UTC+9, shree 님의 말: >>>> >>>> https://github.com/tesseract-ocr/tesseract/issues/1009 >>>> >>>> Link works ok >>>> >>>> On Tue 13 Mar, 2018, 12:37 PM 이경준, <[email protected]> wrote: >>>> >>>>> Shreeshrii <https://github.com/Shreeshrii> commented on 29 Jun 2017 >>>>> <https://github.com/tesseract-ocr/tesseract/issues/1012#issuecomment-311892286> >>>>> • >>>>> edited >>>>> >>>>> I think this happens when the complex characters in your training text >>>>> are not part of the original Korean Unicharset that the 4.00.00alpha >>>>> kor.traineddata was trained with. >>>>> >>>>> Do 'replace top layer' training instead of finetune. @abhishekchopde >>>>> <https://github.com/abhishekchopde> has had good results with it - >>>>> see #1009 <https://github.com/tesseract-ocr/tesseract/issues/1009> >>>>> >>>>> It will take longer than finetuning. >>>>> >>>>> >>>>> >>>>> Hi shree I have a question ... you uploade this passage . But this >>>>> link is not right . plz check again >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to [email protected]. >>>>> To post to this group, send email to [email protected]. >>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/tesseract-ocr/2878cbf6-a06 >>>>> 4-4fe5-ab5c-cfcd54248e9e%40googlegroups.com >>>>> <https://groups.google.com/d/msgid/tesseract-ocr/2878cbf6-a064-4fe5-ab5c-cfcd54248e9e%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To post to this group, send email to [email protected]. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit https://groups.google.com/d/ms >>> gid/tesseract-ocr/d94d0cc3-79f0-4a6e-9cee-92b616424459%40goo >>> glegroups.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/d94d0cc3-79f0-4a6e-9cee-92b616424459%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> For more options, visit https://groups.google.com/d/optout. >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/7ba3c6fe-c66d-428d-95ee-aed8e149c6b9% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/7ba3c6fe-c66d-428d-95ee-aed8e149c6b9%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduX-ZXnmneEAS5HB-8qw7w9K3p7pL3D%3DX24Eet1rd5qajw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

