Recommendation from Ray is to use tessdata_fast
On Sat, Mar 3, 2018 at 11:27 PM, Dusayanta Prasad <[email protected] > wrote: > Which produces the better result- tessdata_fast or tessdata_best? > > On Saturday, March 3, 2018 at 6:26:58 PM UTC+5:30, shree wrote: >> >> The exact directory will depend both on the type of training data, and >> your Linux distribtion. Possibilities are /usr/share/tesseract-ocr/t >> essdata or /usr/share/tessdata or /usr/share/tesseract-ocr/4.00/tessdata. >> >> ShreeDevi >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >> On Sat, Mar 3, 2018 at 6:24 PM, ShreeDevi Kumar <[email protected]> >> wrote: >> >>> Also check >>> >>> tesseract --list-langs >>> >>> ShreeDevi >>> ____________________________________________________________ >>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>> >>> On Sat, Mar 3, 2018 at 6:22 PM, ShreeDevi Kumar <[email protected]> >>> wrote: >>> >>>> ls -l /home/dusayanta/tesseract/tessdata/eng.traineddata >>>> >>>> combine_tessdata -d /home/dusayanta/tesseract/tessdata/eng.traineddata >>>> >>>> >>>> ShreeDevi >>>> ____________________________________________________________ >>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>> >>>> On Sat, Mar 3, 2018 at 5:57 PM, ShreeDevi Kumar <[email protected]> >>>> wrote: >>>> >>>>> No, I had not pre-processed the iame. >>>>> >>>>> I used tessdata_fast NOT tessdata_best. >>>>> >>>>> ShreeDevi >>>>> ____________________________________________________________ >>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>>> >>>>> On Sat, Mar 3, 2018 at 3:59 PM, Dusayanta Prasad <[email protected] >>>>> > wrote: >>>>> >>>>>> Please tell me one more thing. Before feeding the image to tesseract >>>>>> do you perform any kind of pre-processing like binarising the image or >>>>>> something like that? >>>>>> I didn't get the same result as yours even after trying Tesseract 4 >>>>>> with eng tessdata_best. >>>>>> >>>>>> On Saturday, March 3, 2018 at 3:38:07 PM UTC+5:30, shree wrote: >>>>>>> >>>>>>> Sure, if you are comfortable building software on Linux. You have to >>>>>>> make sure you have all the dependencies etc. >>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to [email protected]. >>>>>> To post to this group, send email to [email protected]. >>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/tesseract-ocr/cdb166e1-456 >>>>>> 4-49b1-b661-ffa0f6f2e1f5%40googlegroups.com >>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/cdb166e1-4564-49b1-b661-ffa0f6f2e1f5%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> >>>>> >>>> >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/d2752964-b4d1-496a-8260-d60428935b68% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/d2752964-b4d1-496a-8260-d60428935b68%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVbgRfq_xyw-NLuVmDSq%2BH7-0755_qW1d%3D0TXDVbr3z0g%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

