Tesseract4 LSTM training is line based. On Thu 21 Jun, 2018, 12:25 PM chandra churh chatterjee, < [email protected]> wrote:
> Excuse me @Shree Devi Kumar can you please tell me whether data for > training tesseract 4.0 would be better if the data has images which have > paragraphed hand written texts > or single character based texts as follows > > On Wed, Jun 20, 2018 at 9:00 PM Shree Devi Kumar <[email protected]> > wrote: > >> You will have better control on training if you use tesstrain.sh provided >> with tesseract. >> >> On Wed, Jun 20, 2018 at 8:52 PM Navaneetha Bitla <[email protected]> >> wrote: >> >>> http://www.1001fonts.com/handwritten-fonts.html. >>> >>> the above link has 1900+ fonts from that site i have downloaded the ttf >>> files of fonts and converted to tiff files online. >>> >>> then i have trained the tiff files(fonts) using serak trainer. >>> >>> >>> If you got the accuracy just forward the results so everyone can konw >>> and will follw you. >>> >>> Thank you >>> >>> On Wed, Jun 20, 2018 at 3:13 PM, James Q <[email protected]> >>> wrote: >>> >>>> I'm going to be using tesseract 4 and using the tesstrain.sh script. If >>>> I come across things that improve accuracy though I will let you know. >>>> >>>> Where did you find 1300 handwriting fonts? >>>> >>>> On Tuesday, June 19, 2018 at 5:19:54 PM UTC+1, Navaneetha Bitla wrote: >>>>> >>>>> serak trainer using training tesseract 3.5. >>>>> >>>>> >>>>> >>>>> On Tue, Jun 19, 2018 at 9:29 PM, James Q <[email protected]> >>>>> wrote: >>>>> >>>>>> Hi Navaneetha >>>>>> I am also looking to start training tesseract using handwritten fonts >>>>>> and am about to start setting up my training environment. Are you >>>>>> training >>>>>> tesseract 4 by following the guide at >>>>>> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 >>>>>> ? >>>>>> >>>>>> If so are you fine tuning the existing english model, retraining just >>>>>> the top layer(s) or training from scratch with your additional fonts? >>>>>> >>>>>> Thanks >>>>>> Jim >>>>>> >>>>>> On Tuesday, June 19, 2018 at 10:30:30 AM UTC+1, Navaneetha Bitla >>>>>> wrote: >>>>>>> >>>>>>> Hi, this is Navaneetha >>>>>>> >>>>>>> i'm working in hand written character recognition project. >>>>>>> >>>>>>> I have trained 1300 different hand written fonts of english and >>>>>>> moved the files into tessdata directory. >>>>>>> >>>>>>> tested tesseract using the below commands: >>>>>>> >>>>>>> $convert -density 300 input.png -depth 8 -strip -background white >>>>>>> -alpha off out.tiff >>>>>>> >>>>>>> $tesseract out.tiff eng >>>>>>> >>>>>>> The input.png is of Alanis Handa font and i have trained this font >>>>>>> but i'm not getting atleast 40% accuracy. >>>>>>> >>>>>>> Can someone help me. >>>>>>> >>>>>>> >>>>>>> Thanks in advance. >>>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to [email protected]. >>>>>> To post to this group, send email to [email protected]. >>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com >>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> >>>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com >>>> <https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To post to this group, send email to [email protected]. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com?utm_medium=email&utm_source=footer> >>> . >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> >> -- >> >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com >> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CAD_EDkYpHgUU7O%3DRRTP--3-QLbSQntRgbuTeH5vPcW_gStF-zQ%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAD_EDkYpHgUU7O%3DRRTP--3-QLbSQntRgbuTeH5vPcW_gStF-zQ%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUWv1HE9aCFO--LBdougpNEg46VZWv2fBS%3DZ83e%3DLfo9Q%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

