You do not have the LSTM.train config file. - excuse the brevity, sent from mobile
On 05-Apr-2017 1:55 PM, <srns...@gmail.com> wrote: > After u have said, > > I tried in two ways and i am stuck at lstm step: > > Training > > command used: > > /home/p/Documents/T/tesseract-master/training/lstmtraining -U > /home/p/Documents/T/img_frm_3/eng.unicharset \ > > --script_dir /home/p/Documents/T/TESS_4_ALPHA/langdata-master > --debug_interval 100 \ > > --net_spec '[1,36,0,1 Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256 > O1c105]' \ > > --model_output /home/p/Documents/T/ \ > > --train_listfile /home/p/Documents/T/img_frm_3/eng.ArialBold.exp0.txt > \ > > --eval_listfile /home/p/Documents/T/img_frm_3/eng.ArialBold.exp0.txt \ > > --max_iterations 5000 &>/home/p/Documents/T/basetrain.log > > tail -f basetrain.log > Error getting is : > > > Deserialize header failed: BnO. 005 SUBHISHIs TOWN CENTRE > Deserialize header failed: MOKILA SHAKARPALLY > Deserialize header failed: PHONE: 040-8989898989 > Load of page 0 failed! > Load of images failed!! > Deserialize header failed: TIN: 8989898989 > Deserialize header failed: Station 1D: 01 Time: 03:26:46 PM > Deserialize header failed: CASHIER ID:; 3001 Date: 21-02-2017 > Deserialize header failed: (null) > Deserialize header failed: (null) > > > > > > > > > Fine tuning: > > command used:- > > /home/plianto/Documents/Tvat/tesseract-master/training/tesstrain.sh > --fonts_dir /usr/share/fonts --lang eng --linedata_only \ > --training_text /home/plianto/Documents/Tvat/ > img_frm_3/eng.ArialBold.exp0.txt \ > --langdata_dir /home/plianto/Documents/Tvat/TESS_4_ALPHA/langdata-master > --tessdata_dir /usr/share/tesseract-ocr/tessdata \ > --fontlist "Arial Bold" \ > --output_dir /home/plianto/Documents/Tvat/engoutput/ > > error: > > === Phase E: Generating lstmf files === > Using TESSDATA_PREFIX=/usr/share/tesseract-ocr/tessdata > [Wed Apr 5 13:53:05 IST 2017] /usr/local/bin/tesseract > /tmp/tmp.KTk3WgBTWk/eng/eng.Arial_Bold.exp0.tif > /tmp/tmp.KTk3WgBTWk/eng/eng.Arial_Bold.exp0 lstm.train > read_params_file: Can't open lstm.train > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > ERROR: /tmp/tmp.KTk3WgBTWk/eng/eng.Arial_Bold.exp0.lstmf does not exist > or is not readable > > > > > > > > > > On Wednesday, April 5, 2017 at 9:07:40 AM UTC+5:30, shree wrote: >> >> Read >> >> https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM >> >> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 >> >> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTess >> eract-4.00---Finetune >> >> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTess >> eract-4.00---Replacing-Top-Layer-Example >> >> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTess >> eract-4.00---Replace-Top-Layer >> >> and >> >> https://github.com/tesseract-ocr/tesseract/wiki/Documentation >> >> https://github.com/tesseract-ocr/tesseract/wiki/Fonts >> >> https://github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage >> >> https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality >> >> https://github.com/tesseract-ocr/tesseract/wiki/FAQ >> >> >> >> >> ShreeDevi >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >> On Wed, Apr 5, 2017 at 12:54 AM, <srn...@gmail.com> wrote: >> >>> Can you please post some experiences in this post, as there are no posts >>> to train tesseract 4. >>> >>> 1)And also, is there any way to add the new trained data file to old >>> trained data file, without replacing the old file. >>> 2)If we dont know what font we may get in our images, then how should we >>> proceed in training the tessract >>> >>> On Tuesday, April 4, 2017 at 9:27:06 PM UTC+5:30, Saurabh Srivastav >>> wrote: >>>> >>>> Yes, i trained my tesseract for eng font and make them read the >>>> characters from image. >>>> >>>>> thanks, >>>>>> Saurabh Srivastav >>>>>> >>>>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-oc...@googlegroups.com. >>> To post to this group, send email to tesser...@googlegroups.com. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit https://groups.google.com/d/ms >>> gid/tesseract-ocr/9c88494c-6d80-4b31-b247-dbbacd48bc19%40goo >>> glegroups.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/9c88494c-6d80-4b31-b247-dbbacd48bc19%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/6e9e098f-da2f-4c4a-a866-24f9938bdb1b% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/6e9e098f-da2f-4c4a-a866-24f9938bdb1b%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWguS5RyJnag5WzQ6-5qyAHoNnv841z8b8rvJV6umtP8Q%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.