I am attaching the OCRed text. Please correct it so that I can use as groundtruth for further training and testing.
On Wed, Jun 20, 2018 at 3:15 PM Shree Devi Kumar <[email protected]> wrote: > I had done a training for sanskrit for both devanagari and IAST but it > does not include cedilla for Sh > > I will add it and let you know. > > On Wed 20 Jun, 2018, 1:17 AM yajva, <[email protected]> wrote: > >> I have tried Google OCR for recognizing Sanskrit text in Roman with >> diacritics (IAST). It recognizes above macron but not dots below also >> joining grave and accent. Is there any traineddata available for tesseract >> that can do this with good accuracy ? Attached a sample page that I am >> interested in. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/aef0797b-8df3-4db7-9a3b-02f62d2e5a28%40googlegroups.com >> <https://groups.google.com/d/msgid/tesseract-ocr/aef0797b-8df3-4db7-9a3b-02f62d2e5a28%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWn1aLC%2Bt5EcruM8X3isE9WPgTzJow4rbF-23gUSHEufA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.
Çrīgaheçāya nam a ḥ. 1. A thāto Gobhiloktānām anyeshālṁ caiva kārmāṇām aspashṭānāṁ vidhiṁ samyag dārçayishye pradīpavat | 1. | Trīvṛā īūirdāhvavṛtali, kāryaṁ tanṭutṭrayam adhovṛtam trivṛt tac copaāvītalṁ syāt tasyaiko granthir ishyate | 2. | Pṛṣhṭhavaṁçe ca nābhyāṁ ca dhṛtaṁ yad vindate kaṭim tad dhāryam upavītaṁ syān nātolambali na cocchritam | 83. | Sadopavītinā bhāvyaṁ sadā baddhaçikhena cā viçikho vyupavītaç ca yat karoti na tat kṛtam | 4. | Triḥ prāçyāpo dvir unmṛjya mukham etāny upaspṛçect āsyanāsākṣhikarṇāṁç ca nābhivakṣhahcçiroṁsakān | 5. | Aṅgushṭhena pradeçinyā ghrāṇaṁ caivam upaspṛçet / aṅgushṭhānāmikābhyāṁ ca cakṣhuḥ çrotṛa punaḥ punaḥ | 6. | Kanishṭhāṅgushṭhayor nābhiṁ hṛdayaṁ tu talena vai sarvābhis tu giraḥ paçcād bāhū cāgreṇa saṁspṛçet | 7. | Yatropadiçyāte karma kartur aṅgaṁ na tūcyate * dakṣhiṇas tatra vijñeyaḥ karmaṇāṁ pāragaḥ karaḥ | 8. | Yatra diṅniyamo na syāj japahomādikarmasu tisras tatra diçaḥ proktā aindrīsaumyāparājitā]ḥ | 9. | Tishṭhāann āsīnaḥ prahvo vā niyamo yatra nedṛçaḥ tadāsīnena kartavyaṁ na prahveṇa na tishṭhatā | 10. |

