Excuse me @Shree Devi Kumar can you please tell me whether data for training tesseract 4.0 would be better if the data has images which have paragraphed hand written texts or single character based texts as follows
On Wed, Jun 20, 2018 at 9:00 PM Shree Devi Kumar <[email protected]> wrote: > You will have better control on training if you use tesstrain.sh provided > with tesseract. > > On Wed, Jun 20, 2018 at 8:52 PM Navaneetha Bitla <[email protected]> > wrote: > >> http://www.1001fonts.com/handwritten-fonts.html. >> >> the above link has 1900+ fonts from that site i have downloaded the ttf >> files of fonts and converted to tiff files online. >> >> then i have trained the tiff files(fonts) using serak trainer. >> >> >> If you got the accuracy just forward the results so everyone can konw and >> will follw you. >> >> Thank you >> >> On Wed, Jun 20, 2018 at 3:13 PM, James Q <[email protected]> >> wrote: >> >>> I'm going to be using tesseract 4 and using the tesstrain.sh script. If >>> I come across things that improve accuracy though I will let you know. >>> >>> Where did you find 1300 handwriting fonts? >>> >>> On Tuesday, June 19, 2018 at 5:19:54 PM UTC+1, Navaneetha Bitla wrote: >>>> >>>> serak trainer using training tesseract 3.5. >>>> >>>> >>>> >>>> On Tue, Jun 19, 2018 at 9:29 PM, James Q <[email protected]> >>>> wrote: >>>> >>>>> Hi Navaneetha >>>>> I am also looking to start training tesseract using handwritten fonts >>>>> and am about to start setting up my training environment. Are you training >>>>> tesseract 4 by following the guide at >>>>> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 >>>>> ? >>>>> >>>>> If so are you fine tuning the existing english model, retraining just >>>>> the top layer(s) or training from scratch with your additional fonts? >>>>> >>>>> Thanks >>>>> Jim >>>>> >>>>> On Tuesday, June 19, 2018 at 10:30:30 AM UTC+1, Navaneetha Bitla wrote: >>>>>> >>>>>> Hi, this is Navaneetha >>>>>> >>>>>> i'm working in hand written character recognition project. >>>>>> >>>>>> I have trained 1300 different hand written fonts of english and moved >>>>>> the files into tessdata directory. >>>>>> >>>>>> tested tesseract using the below commands: >>>>>> >>>>>> $convert -density 300 input.png -depth 8 -strip -background white >>>>>> -alpha off out.tiff >>>>>> >>>>>> $tesseract out.tiff eng >>>>>> >>>>>> The input.png is of Alanis Handa font and i have trained this font >>>>>> but i'm not getting atleast 40% accuracy. >>>>>> >>>>>> Can someone help me. >>>>>> >>>>>> >>>>>> Thanks in advance. >>>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to [email protected]. >>>>> To post to this group, send email to [email protected]. >>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com >>>>> <https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> >>>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To post to this group, send email to [email protected]. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com >> <https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > > -- > > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAD_EDkYpHgUU7O%3DRRTP--3-QLbSQntRgbuTeH5vPcW_gStF-zQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

