Please test with traineddata file from
https://github.com/Shreeshrii/tessdata_sanskrit/tree/master/iast-plus1

Need to check that is it not overfitted.

Please share a couple more images which I can use for testing.


On Thu, Jun 21, 2018 at 11:38 PM yajva <nsvnarasi...@gmail.com> wrote:

> one more correction.
>
>
> On Thursday, June 21, 2018 at 11:34:00 PM UTC+5:30, yajva wrote:
>>
>> done
>>
>> On Wednesday, June 20, 2018 at 9:05:01 PM UTC+5:30, shree wrote:
>>>
>>> I am attaching the OCRed text. Please correct it so that  I can use as
>>> groundtruth for further training and testing.
>>>
>>> On Wed, Jun 20, 2018 at 3:15 PM Shree Devi Kumar <shree...@gmail.com>
>>> wrote:
>>>
>>>> I had done a training for sanskrit for both devanagari and IAST but it
>>>> does not include cedilla for Sh
>>>>
>>>> I will add it and let you know.
>>>>
>>>> On Wed 20 Jun, 2018, 1:17 AM yajva, <nsvnar...@gmail.com> wrote:
>>>>
>>>>> I have tried Google OCR for recognizing Sanskrit text in Roman with
>>>>> diacritics (IAST). It recognizes above macron but not dots below also
>>>>> joining grave and accent. Is there any traineddata available for tesseract
>>>>> that can do this with good accuracy ? Attached a sample page that I am
>>>>> interested in.
>>>>>
>>>>> --
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>>> an email to tesseract-oc...@googlegroups.com.
>>>>> To post to this group, send email to tesser...@googlegroups.com.
>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>>> To view this discussion on the web visit
>>>>> https://groups.google.com/d/msgid/tesseract-ocr/aef0797b-8df3-4db7-9a3b-02f62d2e5a28%40googlegroups.com
>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/aef0797b-8df3-4db7-9a3b-02f62d2e5a28%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>> .
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>>
>>>
>>> --
>>>
>>> ____________________________________________________________
>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/a7bdf637-7f17-4eb3-8fa8-297018633bfa%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/a7bdf637-7f17-4eb3-8fa8-297018633bfa%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXBbHz9FY6hHMitiDAJtckxnyBXYfX7ZJLQhLxgGb2gAg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to