After u have said, 

I tried in two ways and i am stuck at lstm step:

Training

command used: 

/home/p/Documents/T/tesseract-master/training/lstmtraining -U 
/home/p/Documents/T/img_frm_3/eng.unicharset \
>   --script_dir /home/p/Documents/T/TESS_4_ALPHA/langdata-master 
--debug_interval 100 \
>   --net_spec '[1,36,0,1 Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256 
O1c105]' \
>   --model_output /home/p/Documents/T/ \
>   --train_listfile /home/p/Documents/T/img_frm_3/eng.ArialBold.exp0.txt \
>   --eval_listfile /home/p/Documents/T/img_frm_3/eng.ArialBold.exp0.txt \
>   --max_iterations 5000 &>/home/p/Documents/T/basetrain.log

tail -f basetrain.log 
Error getting is :


Deserialize header failed: BnO. 005 SUBHISHIs TOWN CENTRE
Deserialize header failed: MOKILA SHAKARPALLY
Deserialize header failed: PHONE: 040-8989898989
Load of page 0 failed!
Load of images failed!!
Deserialize header failed: TIN: 8989898989
Deserialize header failed: Station 1D: 01 Time: 03:26:46 PM
Deserialize header failed: CASHIER ID:; 3001 Date: 21-02-2017
Deserialize header failed: (null)
Deserialize header failed: (null)








Fine tuning:

command used:-

/home/plianto/Documents/Tvat/tesseract-master/training/tesstrain.sh 
--fonts_dir /usr/share/fonts --lang eng --linedata_only \
  --training_text 
/home/plianto/Documents/Tvat/img_frm_3/eng.ArialBold.exp0.txt \
  --langdata_dir /home/plianto/Documents/Tvat/TESS_4_ALPHA/langdata-master  
--tessdata_dir /usr/share/tesseract-ocr/tessdata \
  --fontlist "Arial Bold" \
  --output_dir /home/plianto/Documents/Tvat/engoutput/

error:

=== Phase E: Generating lstmf files ===
Using TESSDATA_PREFIX=/usr/share/tesseract-ocr/tessdata
[Wed Apr 5 13:53:05 IST 2017] /usr/local/bin/tesseract 
/tmp/tmp.KTk3WgBTWk/eng/eng.Arial_Bold.exp0.tif 
/tmp/tmp.KTk3WgBTWk/eng/eng.Arial_Bold.exp0 lstm.train
read_params_file: Can't open lstm.train
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
ERROR: /tmp/tmp.KTk3WgBTWk/eng/eng.Arial_Bold.exp0.lstmf does not exist or 
is not readable









On Wednesday, April 5, 2017 at 9:07:40 AM UTC+5:30, shree wrote:
>
> Read
>
> https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM
>
> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00
>
>
> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Finetune
>
>
> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Replacing-Top-Layer-Example
>
>
> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Replace-Top-Layer
>
> and
>
> https://github.com/tesseract-ocr/tesseract/wiki/Documentation
>
> https://github.com/tesseract-ocr/tesseract/wiki/Fonts
>
> https://github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage
>
> https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality
>
> https://github.com/tesseract-ocr/tesseract/wiki/FAQ
>
>
>
>
> ShreeDevi
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> On Wed, Apr 5, 2017 at 12:54 AM, <[email protected] <javascript:>> wrote:
>
>> Can you please post some experiences in this post, as there are no posts 
>> to train tesseract 4.
>>
>> 1)And also, is there any way to add the new trained data file to old 
>> trained data file, without replacing the old file.
>> 2)If we dont know what font we may get in our images, then how should we 
>> proceed in training the tessract 
>>
>> On Tuesday, April 4, 2017 at 9:27:06 PM UTC+5:30, Saurabh Srivastav wrote:
>>>
>>> Yes, i trained my tesseract for eng font and make them read the 
>>> characters from image.
>>>
>>>> thanks,
>>>>> Saurabh Srivastav
>>>>>
>>>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected] 
>> <javascript:>.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/9c88494c-6d80-4b31-b247-dbbacd48bc19%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/9c88494c-6d80-4b31-b247-dbbacd48bc19%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6e9e098f-da2f-4c4a-a866-24f9938bdb1b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to