On Mon, May 20, 2013 at 4:08 PM, <[email protected]> wrote: > > > El dilluns 20 de maig de 2013 13:29:38 UTC, jimregan va escriure: > >> On Saturday, 18 May 2013 12:51:54 UTC+1, [email protected] wrote: >> >>> Hi, >>> >>> >> Hi Fran. >> >> >>> If I wanted to integrate a "spellchecker" (or wordlist) other than the >>> DAWG one that is bundled with Tesseract, how might I go about it ? >>> >>> >> There was a version of Tesseract that did this, using OpenFST, in one of >> the Android trees (I think the original AOSP tree), but you'd have to dig >> through old revisions to find it. >> >> > > Looks like it is this one: > > > https://android.googlesource.com/platform/external/tesseract/+/d544c9231465999ad600ec13614b4d69d351798d/ > > The date is 3 years and 10 months ago. Have any substantial improvements > in OCR quality been made in the main branch since then ? e.g. If I just > work with this branch will I get much worse results than using HEAD ? > > It looks like 2.04 version... maybe it is even older... e.g. aspirin directory was empty in r12[1] why there[2] are files...
[1] https://code.google.com/p/tesseract-ocr/source/browse/trunk/?r=12#trunk%2Faspirin%253Fstate%253Dclosed [2] https://android.googlesource.com/platform/external/tesseract/+/d544c9231465999ad600ec13614b4d69d351798d/aspirin/ -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

