> tessdata repo supports both --oem 0 and --oem 1, but the files are older and may NOT be fully compatible with current code.
The results may vary depending on language and oem used. I have NOT tested this much, since newer traineddata give better accuracy for Indian languages. ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sat, Mar 3, 2018 at 9:42 AM, ShreeDevi Kumar <shreesh...@gmail.com> wrote: > Hi Simon, > > If you are planning to package using 4.00alpha from master branch, please > use traineddata files from tessdata_fast. These are the files that have > been shipped for Ubuntu 18.04 and included in Debian. See > https://github.com/tesseract-ocr/tesseract/wiki for some links. > > You can update the wiki page re cygwin. > > FYI - tessdata repo supports both --oem 0 and --oem 1, but the files are > older and may NOT be fully compatible with current code. > > tessdata_best has files which can be used for further finetune/plusminus > type training. > > *tessdata_fast has faster integer models and is the recommended one to be > used for OCR. * > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Sat, Mar 3, 2018 at 2:04 AM, Simon Eigeldinger < > simon.eigeldin...@vol.at> wrote: > >> Hi all, >> >> Just looked at the git commits for tesseract and read that there has been >> changes to the OCR modes. >> are the 3 tessdata sets still valid? >> tessdata_fast and tessdata_best have been updated so i guess those >> reflect the latest developments but tessdata hasn't an update since >> september. >> is that 3rd set still useable or shouldn't that ome not be used anymore? >> on the wiki >> https://github.com/tesseract-ocr/tesseract/wiki/Data-Files >> it's still listed as useable. >> >> Any suggestions? >> >> Greetings and thanks, >> Simon >> >> --- >> Diese E-Mail wurde von Avast Antivirus-Software auf Viren geprüft. >> https://www.avast.com/antivirus >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-ocr+unsubscr...@googlegroups.com. >> To post to this group, send email to tesseract-ocr@googlegroups.com. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit https://groups.google.com/d/ms >> gid/tesseract-ocr/3c4c0b75-b411-3227-26e1-d1d2485b9572%40vol.at. >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVg-mr01FcNEtN7w3O69aOYmwFTXhDwyMiiUMwprqjyhA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.