Re: [tesseract-ocr] train tesseract OCR 4.0

2018-10-22 Thread Shree Devi Kumar
Please see https://github.com/tesseract-ocr/tesseract/wiki and
https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for-impact

On Mon, 22 Oct 2018, 06:59 kislay bajpai,  wrote:

> Hello,
>
> Sorry to disturb you, actually i am very new with tesseract and getting no
> idea, how to train it.
> Please help me out. I am in big trouble.
>
> version - tesseract4.0 alpha
> OS - ubuntu16.04 and RHEL 7.3 (any one i can use)
>
> On Tue, Oct 16, 2018 at 7:10 PM Shree Devi Kumar 
> wrote:
>
>> Please do not use tesseract 4.0 alpha. There have been many changes since
>> then.
>>
>> Use the latest code from github, which is 4.0.0-rc3 or install from
>> Alex's PPA or from ub mannheim (for Windows).
>>
>> Please read the wiki pages about training for new font for tesseract 4 -
>> fine tuning for Impact.
>>
>> On Tue, 16 Oct 2018, 08:33 kislay bajpai, 
>> wrote:
>>
>>> Hello Shree,
>>>
>>> I am confused how to train tesseract 4.0 alpha for new font (E 13B).
>>> Please help me for it.
>>> .
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU1ZNHmbkPAraFAO2a7AzQTwDyGi9%3D9ZAs8ipBPU%2B1NMw%40mail.gmail.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
> --
> Thanks and regards
> Kislay Bajpai
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAKPmCYj_E-TnZxuyzZstJSHDDZydistcaM1ik0S6%2B-ZS1kRX0w%40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduW144PVh%3Dr7wjm4aEFdaXw9Q3Zg_UCF0Zfd5Buq06CO9Q%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2018-10-22 Thread kislay bajpai
Hello,

Sorry to disturb you, actually i am very new with tesseract and getting no
idea, how to train it.
Please help me out. I am in big trouble.

version - tesseract4.0 alpha
OS - ubuntu16.04 and RHEL 7.3 (any one i can use)

On Tue, Oct 16, 2018 at 7:10 PM Shree Devi Kumar 
wrote:

> Please do not use tesseract 4.0 alpha. There have been many changes since
> then.
>
> Use the latest code from github, which is 4.0.0-rc3 or install from Alex's
> PPA or from ub mannheim (for Windows).
>
> Please read the wiki pages about training for new font for tesseract 4 -
> fine tuning for Impact.
>
> On Tue, 16 Oct 2018, 08:33 kislay bajpai, 
> wrote:
>
>> Hello Shree,
>>
>> I am confused how to train tesseract 4.0 alpha for new font (E 13B).
>> Please help me for it.
>> .
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU1ZNHmbkPAraFAO2a7AzQTwDyGi9%3D9ZAs8ipBPU%2B1NMw%40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>


-- 
Thanks and regards
Kislay Bajpai

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAKPmCYj_E-TnZxuyzZstJSHDDZydistcaM1ik0S6%2B-ZS1kRX0w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2018-10-16 Thread kislay bajpai
Hello Shree, 

I am confused how to train tesseract 4.0 alpha for new font (E 13B). Please 
help me for it.

On Thursday, March 23, 2017 at 5:24:59 PM UTC+5:30, shree wrote:
>
> To read characters from an image, it is not necessary to train it. Just 
> use an appropriate traineddata.
>
> Training is required only if it is  a new language or font or some such 
> special circumstance.
>
> Read the wiki for documentation.
>
> https://github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage
>
> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract
>
> ShreeDevi
> 
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> On Wed, Mar 22, 2017 at 10:00 PM, Saurabh Srivastav <
> saurabhkum...@gmail.com > wrote:
>
>> Thank you shree for your valuable reply. But now i have created box files 
>> for a particuler image and trained it..but still i am missing something, 
>> may you please help me what i have to do after creating box file for that 
>> image and make tesseract to read the characters from that image.
>>
>> thanks and regards.
>>
>> On Friday, March 3, 2017 at 12:53:31 PM UTC+5:30, shree wrote:
>>>
>>> screenshot of warning  means that your image does not have resolution 
>>> info. Your OCR output file should have been created.
>>>
>>> Training 4.0 is not easy. Please see 
>>> https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM
>>>
>>> ShreeDevi
>>> 
>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>
>>> On Fri, Mar 3, 2017 at 12:17 PM, Saurabh Srivastav  
>>> wrote:
>>>
 how to train tesseract 4.0. Please help me..

 thanks,
 Saurabh Srivastav

 -- 
 You received this message because you are subscribed to the Google 
 Groups "tesseract-ocr" group.
 To unsubscribe from this group and stop receiving emails from it, send 
 an email to tesseract-oc...@googlegroups.com.
 To post to this group, send email to tesser...@googlegroups.com.
 Visit this group at https://groups.google.com/group/tesseract-ocr.
 To view this discussion on the web visit 
 https://groups.google.com/d/msgid/tesseract-ocr/f1782fd1-97a1-40db-8ba0-f003052f39ae%40googlegroups.com
  
 
 .
 For more options, visit https://groups.google.com/d/optout.

>>>
>>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com .
>> To post to this group, send email to tesser...@googlegroups.com 
>> .
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/14d1eb0f-7881-4d71-82ba-25e85f8867fa%40googlegroups.com
>>  
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/e13b5c9d-d1a2-48e7-b55c-7f26e8ec110d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-05 Thread srnsp92
You can use *.* when identifying the files.. but you should be careful only 
image files are only supplied... as it can take all available files, 
because * means it takes input for all the files.

1)I request you can help me with posts i had posted today.. 
2) And please guide how can i generate lstm files for images which i have 
to use..
and pls explain how you have followed...


On Tuesday, April 4, 2017 at 9:38:24 PM UTC+5:30, Saurabh Srivastav wrote:
>
> thank you shree ,
> you always help me.
>
> but i still have one problem that i wrote a bash script which trace the 
> all images with .jpg extension and make their output files as the name of 
> image.
> but i want that when i run script it trace more images with some different 
> extensions like .jpg , .jpeg , .png .is it possible? if it is, then please 
> help me out.
>
>
> thank you shree,
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/714a18af-d711-4cee-8a3c-1d109e5bc0f5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread Saurabh Srivastav
thank you shree ,
you always help me.

but i still have one problem that i wrote a bash script which trace the all 
images with .jpg extension and make their output files as the name of image.
but i want that when i run script it trace more images with some different 
extensions like .jpg , .jpeg , .png .is it possible? if it is, then please 
help me out.


thank you shree,

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/bd41e06d-9e2f-41c6-a237-4528dc7a8f13%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread ShreeDevi Kumar
Tesstrain.sh generates a file called eng.training_files.txt

You are using command without .text extension

Check the name of generated file and use that.

I have found that editing that file also gives errors.
- excuse the brevity, sent from mobile

On 04-Apr-2017 7:01 PM,  wrote:

> I am trying to tesseract 4,, and i am getting folowing error,,
>
> command used:
>
> mkdir -p /home/p/Documents/T/engoutput
> /home/p/Documents/T/tesseract-master/training/lstmtraining -U
> /home/p/Documents/T/img_frm_3/unicharset \
>   --script_dir /home/p/Documents/T/TESS_4_ALPHA/langdata-master
> --debug_interval 100 \
>   --train_listfile /home/p/Documents/T/TESS_4_
> ALPHA/langdata-master/eng/eng.training_files \
>   --eval_listfile /home/p/Documents/T/TESS_4_
> ALPHA/langdata-master/eng/eng.training_files \
>   --max_iterations 5000 &>/home/p/Documents/T/basetrain.log
>
> used for log:
> tail -f basetrain.log
> Failed to load list of training filenames from /home/p/Documents/T/TESS_4_
> ALPHA/langdata-master/eng/eng.training_files
> tail: basetrain.log: file truncated
>
>
>
> error getting:
> Failed to load list of training filenames from /home/p/Documents/T/TESS_4_
> ALPHA/langdata-master/eng/eng.training_files
>
>
>
>
> On Tuesday, April 4, 2017 at 6:23:33 PM UTC+5:30, shree wrote:
>>
>> See
>>
>> https://github.com/tesseract-ocr/tesseract/blob/master/train
>> ing/tesstrain.sh
>>
>> https://github.com/tesseract-ocr/tesseract/blob/master/train
>> ing/tesstrain_utils.sh
>>
>> https://github.com/tesseract-ocr/tesseract/blob/master/train
>> ing/language-specific.sh
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/77c03857-e090-4a68-9cb9-505ff9ba52d4%
> 40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVNi1K8LRrtHv0fGvWJysn--OSStW932s%2BiRYFPX8L3qw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread srnsp92
I am trying to tesseract 4,, and i am getting folowing error,, 

command used: 

mkdir -p /home/p/Documents/T/engoutput
/home/p/Documents/T/tesseract-master/training/lstmtraining -U 
/home/p/Documents/T/img_frm_3/unicharset \
  --script_dir /home/p/Documents/T/TESS_4_ALPHA/langdata-master 
--debug_interval 100 \
  --train_listfile 
/home/p/Documents/T/TESS_4_ALPHA/langdata-master/eng/eng.training_files \
  --eval_listfile 
/home/p/Documents/T/TESS_4_ALPHA/langdata-master/eng/eng.training_files \
  --max_iterations 5000 &>/home/p/Documents/T/basetrain.log

used for log:
tail -f basetrain.log 
Failed to load list of training filenames from 
/home/p/Documents/T/TESS_4_ALPHA/langdata-master/eng/eng.training_files
tail: basetrain.log: file truncated



error getting: 
Failed to load list of training filenames from 
/home/p/Documents/T/TESS_4_ALPHA/langdata-master/eng/eng.training_files




On Tuesday, April 4, 2017 at 6:23:33 PM UTC+5:30, shree wrote:
>
> See
>
>
> https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain.sh
>
>
> https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain_utils.sh
>
>
> https://github.com/tesseract-ocr/tesseract/blob/master/training/language-specific.sh
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/77c03857-e090-4a68-9cb9-505ff9ba52d4%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread ShreeDevi Kumar
See

https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain.sh

https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain_utils.sh

https://github.com/tesseract-ocr/tesseract/blob/master/training/language-specific.sh

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduV99at4Uzvyk4HxxMONL%3DB51V-MV7GS8HNk11ziqkD5xQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread srnsp92
Hello ShreeDevi,

https://medium.com/apegroup-texts/training-tesseract-for-labels-receipts-and-such-690f452e8f79

In the link, we can see a full fledged tutorial of tesseract 3.0 version, 
of using it and training it. Can you please clarify the below points...?

https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00

But  in the github link, i feel its good if they elaborate more..

1) How should i train tesseract if i dont know or i may get random fonts in 
image files. ?

2) In github tutorial, its specified that we should skip clustering steps 
(mftraining, cntraining, shapeclustering)  ?

3) And I want to generate a trained data file, and want to merge with 
tessdata(already present ) and dont want to replace it?


Can you please specify how to achieve these steps..?


Thank You.






On Monday, April 3, 2017 at 8:11:33 PM UTC+5:30, shree wrote:
>
> Saurabh,
>
> It depends on what you want to do with the bash script.
>
> Here is a sample of a script I used to compare results using diff tessdata 
> files by looping thru a set of image files. Google the bash commands to 
> figure out what they do!
>
> #!/bin/bash
> set -vx
> export TESSDATA_PREFIX=/mnt/c/Users/User/shree/tesseract-ocr
>
> img_files=$(ls *.jpeg)
> for img_file in ${img_files}; do
> time tesseract ${img_file} ${img_file%.*}-ssd  -l ssd
> time tesseract ${img_file} ${img_file%.*}-ssdsmall  --psm 6 --oem 
> 1 -l ssdsmall 
> time tesseract ${img_file} ${img_file%.*}-eng  --psm 6 --oem 1 -l 
> eng 
> done
>
>
> ShreeDevi
> 
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> On Mon, Apr 3, 2017 at 7:10 PM, Saurabh Srivastav  > wrote:
>
>> hello  shree ! thank you for your help.
>> may you please help me how can i write a bash  script for tesseract.
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com .
>> To post to this group, send email to tesser...@googlegroups.com 
>> .
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/ac53f578-d14c-401b-b65e-b222fe4cb067%40googlegroups.com
>>  
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/358f92a6-2dba-4ef2-b02a-925accfa94ff%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-03 Thread Saurabh Srivastav
shree,
 actually i want a bash script which run tesseract  and store ouput 
file in a folder..

kindly help me to make this type of bash script.


thank you.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/9d3c893b-010e-44e9-b3c1-1b83e66c4649%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-03 Thread ShreeDevi Kumar
Saurabh,

It depends on what you want to do with the bash script.

Here is a sample of a script I used to compare results using diff tessdata
files by looping thru a set of image files. Google the bash commands to
figure out what they do!

#!/bin/bash
set -vx
export TESSDATA_PREFIX=/mnt/c/Users/User/shree/tesseract-ocr

img_files=$(ls *.jpeg)
for img_file in ${img_files}; do
time tesseract ${img_file} ${img_file%.*}-ssd  -l ssd
time tesseract ${img_file} ${img_file%.*}-ssdsmall  --psm 6 --oem 1
-l ssdsmall
time tesseract ${img_file} ${img_file%.*}-eng  --psm 6 --oem 1 -l
eng
done


ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Mon, Apr 3, 2017 at 7:10 PM, Saurabh Srivastav <
saurabhkumarsrivas...@gmail.com> wrote:

> hello  shree ! thank you for your help.
> may you please help me how can i write a bash  script for tesseract.
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/ac53f578-d14c-401b-b65e-b222fe4cb067%
> 40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWM5M%2BnQ%3Dbg_3EV%2Bbj6ViXYVCMgNWprQA6uwWr3vzdGuw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-03 Thread Saurabh Srivastav
hello  shree ! thank you for your help.
may you please help me how can i write a bash  script for tesseract.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/ac53f578-d14c-401b-b65e-b222fe4cb067%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-03-22 Thread Saurabh Srivastav
Thank you shree for your valuable reply. But now i have created box files 
for a particuler image and trained it..but still i am missing something, 
may you please help me what i have to do after creating box file for that 
image and make tesseract to read the characters from that image.

thanks and regards.

On Friday, March 3, 2017 at 12:53:31 PM UTC+5:30, shree wrote:
>
> screenshot of warning  means that your image does not have resolution 
> info. Your OCR output file should have been created.
>
> Training 4.0 is not easy. Please see 
> https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM
>
> ShreeDevi
> 
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> On Fri, Mar 3, 2017 at 12:17 PM, Saurabh Srivastav  > wrote:
>
>> how to train tesseract 4.0. Please help me..
>>
>> thanks,
>> Saurabh Srivastav
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com .
>> To post to this group, send email to tesser...@googlegroups.com 
>> .
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/f1782fd1-97a1-40db-8ba0-f003052f39ae%40googlegroups.com
>>  
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/14d1eb0f-7881-4d71-82ba-25e85f8867fa%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] train tesseract OCR 4.0

2017-03-02 Thread ShreeDevi Kumar
screenshot of warning  means that your image does not have resolution info.
Your OCR output file should have been created.

Training 4.0 is not easy. Please see
https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM

ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Fri, Mar 3, 2017 at 12:17 PM, Saurabh Srivastav 
wrote:

> how to train tesseract 4.0. Please help me..
>
> thanks,
> Saurabh Srivastav
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/f1782fd1-97a1-40db-8ba0-f003052f39ae%
> 40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU1SPaExBDbRd9euitkCFpXo3v8tpShnpuXU8g%3DivGBhQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.