Tesseract4 LSTM training is line based.

On Thu 21 Jun, 2018, 12:25 PM chandra churh chatterjee, <
[email protected]> wrote:

> Excuse me @Shree Devi Kumar can you please tell me whether data for
> training tesseract 4.0 would be better if the data has images which have
> paragraphed hand written texts
> or single character based texts as follows
>
> On Wed, Jun 20, 2018 at 9:00 PM Shree Devi Kumar <[email protected]>
> wrote:
>
>> You will have better control on training if you use tesstrain.sh provided
>> with tesseract.
>>
>> On Wed, Jun 20, 2018 at 8:52 PM Navaneetha Bitla <[email protected]>
>> wrote:
>>
>>> http://www.1001fonts.com/handwritten-fonts.html.
>>>
>>> the above link has 1900+ fonts from that site i have downloaded the ttf
>>> files of fonts and converted to tiff files online.
>>>
>>> then i have trained the tiff files(fonts) using serak trainer.
>>>
>>>
>>> If you got the accuracy just forward the results so everyone can konw
>>> and will follw you.
>>>
>>> Thank you
>>>
>>> On Wed, Jun 20, 2018 at 3:13 PM, James Q <[email protected]>
>>> wrote:
>>>
>>>> I'm going to be using tesseract 4 and using the tesstrain.sh script. If
>>>> I come across things that improve accuracy though I will let you know.
>>>>
>>>> Where did you find 1300 handwriting fonts?
>>>>
>>>> On Tuesday, June 19, 2018 at 5:19:54 PM UTC+1, Navaneetha Bitla wrote:
>>>>>
>>>>> serak trainer using training tesseract 3.5.
>>>>>
>>>>>
>>>>>
>>>>> On Tue, Jun 19, 2018 at 9:29 PM, James Q <[email protected]>
>>>>> wrote:
>>>>>
>>>>>> Hi Navaneetha
>>>>>> I am also looking to start training tesseract using handwritten fonts
>>>>>> and am about to start setting up my training environment. Are you 
>>>>>> training
>>>>>> tesseract 4 by following the guide at
>>>>>> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00
>>>>>> ?
>>>>>>
>>>>>> If so are you fine tuning the existing english model, retraining just
>>>>>> the top layer(s) or training from scratch with your additional fonts?
>>>>>>
>>>>>> Thanks
>>>>>> Jim
>>>>>>
>>>>>> On Tuesday, June 19, 2018 at 10:30:30 AM UTC+1, Navaneetha Bitla
>>>>>> wrote:
>>>>>>>
>>>>>>> Hi, this is Navaneetha
>>>>>>>
>>>>>>> i'm working in hand written character recognition project.
>>>>>>>
>>>>>>> I have trained 1300 different hand written fonts of english and
>>>>>>> moved the files into tessdata directory.
>>>>>>>
>>>>>>> tested tesseract using the below commands:
>>>>>>>
>>>>>>> $convert -density 300 input.png -depth 8 -strip -background white
>>>>>>> -alpha off out.tiff
>>>>>>>
>>>>>>>  $tesseract out.tiff eng
>>>>>>>
>>>>>>> The input.png is of Alanis Handa font and i have trained this font
>>>>>>> but i'm not getting atleast 40% accuracy.
>>>>>>>
>>>>>>> Can someone help me.
>>>>>>>
>>>>>>>
>>>>>>> Thanks in advance.
>>>>>>>
>>>>>> --
>>>>>> You received this message because you are subscribed to the Google
>>>>>> Groups "tesseract-ocr" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it,
>>>>>> send an email to [email protected].
>>>>>> To post to this group, send email to [email protected].
>>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>>>> To view this discussion on the web visit
>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com
>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>> .
>>>>>>
>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>
>>>>>
>>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to [email protected].
>>>> To post to this group, send email to [email protected].
>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit
>>>> https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>>
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To post to this group, send email to [email protected].
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>
>> --
>>
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected].
>> To post to this group, send email to [email protected].
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com
>> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAD_EDkYpHgUU7O%3DRRTP--3-QLbSQntRgbuTeH5vPcW_gStF-zQ%40mail.gmail.com
> <https://groups.google.com/d/msgid/tesseract-ocr/CAD_EDkYpHgUU7O%3DRRTP--3-QLbSQntRgbuTeH5vPcW_gStF-zQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUWv1HE9aCFO--LBdougpNEg46VZWv2fBS%3DZ83e%3DLfo9Q%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to