Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-07-02 Thread yajva
> > Attached is the OCRed output for pages 13-24 of dark pdf with it. > > I am still training a different variation. > > > > On Wed, Jun 27, 2018 at 6:46 PM Shree Devi Kumar > wrote: > >> ok. I will take a look. >> >> On Wed, Jun 27, 2018 at 5:04 PM y

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-06-27 Thread yajva
87635c1.exe > > I'd be also interested in testing of the tessdata manager, which should > now also properly handle script tessdatas > > On Tue 26 Jun, 2018, 10:59 PM yajva, > > wrote: > >> The doc is diff ver of the same text. Here's the doc used for the first. >> png

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-06-21 Thread yajva
one more correction. On Thursday, June 21, 2018 at 11:34:00 PM UTC+5:30, yajva wrote: > > done > > On Wednesday, June 20, 2018 at 9:05:01 PM UTC+5:30, shree wrote: >> >> I am attaching the OCRed text. Please correct it so that I can use as >> groundtruth fo

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-06-21 Thread yajva
one a training for sanskrit for both devanagari and IAST but it >> does not include cedilla for Sh >> >> I will add it and let you know. >> >> On Wed 20 Jun, 2018, 1:17 AM yajva, > >> wrote: >> >>> I have tried Google OCR for recognizi

Re: [tesseract-ocr] recognising roman with sanskrit diacritics

2018-07-12 Thread yajva
eng+iast-plus-3600 => no diacritics at all Latin+iast-plus-3600 => only macrons none other On Thursday, July 12, 2018 at 1:12:25 AM UTC+5:30, shree wrote: > > What about ocr with > > eng+iast > > > > On Wed 11 Jul, 2018, 7:44 PM yajva, > > wrote: > >&