Just started using tesseract, and want to do exactly what ShreeShrii did here to get kamakoti-san_latn_1.txt <https://github.com/tesseract-ocr/langdata/files/1284401/kamakoti-san_latn_1.txt> ':
https://github.com/tesseract-ocr/langdata/pull/4 However, what value do I need to use for the "-l" option to do this? Or, do I need to install some additional language? I'm on macos, and installed tesseract using 'brew install tesseract'. $ tesseract --version tesseract 4.0.0-beta.3 leptonica-1.76.0 libjpeg 9c : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 Found AVX2 Found AVX Found SSE I suspect that my tesseract setup is different, because, using the Latin option, tesseract -l lat --oem 1 --psm 3 I get the following output text drstva devam mahakalam kalikangam mahaprabhum | bhargavah patito bhumau dandavatsurapujite || bhargava uvaca kalyantakalagnisamanabhasam caturbhujam kalikayopajustam | kapalakhatvangavarabhayadhya- karam mahakalamanantamide || namah paramarüpaya paramalasurupine | niyatipraptadehaya tattvarupaya te namah || namah paramarüpaya paramarthaikarupine | viyanmayasvarupaya bhairavaya namo.astute || OM namah parameésaya paratattvarthadaráine | viyanmayadyadhisaya dhivicitraya $ambhave || triloke$aya güdhaya suksmayavyaktarupine | parakasthadirupaya paraya $ambhave namah || OM namah kalikankaya kalatjananibhaya te | jagatsamharakartre ca mahakalaya te namah || nama ugraya devaya bhimaya bhayadayine | mahabhayavinasaya srstisamharakarine || namah paraparanandasvarupaya mahatmane | paraprakasarüpaya praka$anam praka$sine || OM namo dhyanagamyaya yogihrtpadmavasine | vedatantrarthagamyaya vedatantrarthadarsine || vedagamaparamar$aparamanandadayine | tantravedantavedyaya $ambhave vibhave namah || dhiyam pracodakam yattu paramam jyotiruttamam | tatprerakaya devaya paramajyotise namah || gunaérayaya devaya nirgunaya kapardine | atisthulaya devaya hyatisuksmaya te namah || trigunaya tryadhisaya saktitritayasaline | namastrijyotise tubhyam tryaksaya ca trimürtaye || which is not the same as the text in kamakoti-Latin.txt <https://github.com/tesseract-ocr/langdata/files/1284400/kamakoti-Latin.txt> that ShreeShrii obtained. Help much appreciated, thanks. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/9731f965-c355-4c1b-b4c4-60606dd99654%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

