Just started using tesseract, and want to do exactly what ShreeShrii did 
here to get kamakoti-san_latn_1.txt 
<https://github.com/tesseract-ocr/langdata/files/1284401/kamakoti-san_latn_1.txt>
':

https://github.com/tesseract-ocr/langdata/pull/4

However, what value do I need to use for the "-l" option to do this?  Or, 
do I need to install some additional language?  

I'm on macos, and installed tesseract using 'brew install tesseract'.  

$ tesseract --version
tesseract 4.0.0-beta.3
 leptonica-1.76.0
  libjpeg 9c : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11
 Found AVX2
 Found AVX
 Found SSE


I suspect that my tesseract setup is different, because, using the Latin 
option, 

tesseract -l lat --oem 1 --psm 3

I get the following output text

drstva devam mahakalam kalikangam mahaprabhum |

bhargavah patito bhumau dandavatsurapujite ||

bhargava uvaca

kalyantakalagnisamanabhasam

caturbhujam kalikayopajustam |

kapalakhatvangavarabhayadhya-

karam mahakalamanantamide ||

namah paramarüpaya paramalasurupine |

niyatipraptadehaya tattvarupaya te namah ||

namah paramarüpaya paramarthaikarupine |

viyanmayasvarupaya bhairavaya namo.astute ||

OM namah parameésaya paratattvarthadaráine |

viyanmayadyadhisaya dhivicitraya $ambhave ||

triloke$aya güdhaya suksmayavyaktarupine |

parakasthadirupaya paraya $ambhave namah ||

OM namah kalikankaya kalatjananibhaya te |

jagatsamharakartre ca mahakalaya te namah ||

nama ugraya devaya bhimaya bhayadayine |

mahabhayavinasaya srstisamharakarine ||

namah paraparanandasvarupaya mahatmane |

paraprakasarüpaya praka$anam praka$sine ||

OM namo dhyanagamyaya yogihrtpadmavasine |

vedatantrarthagamyaya vedatantrarthadarsine ||

vedagamaparamar$aparamanandadayine |

tantravedantavedyaya $ambhave vibhave namah ||

dhiyam pracodakam yattu paramam jyotiruttamam |

tatprerakaya devaya paramajyotise namah ||

gunaérayaya devaya nirgunaya kapardine |

atisthulaya devaya hyatisuksmaya te namah ||

trigunaya tryadhisaya saktitritayasaline |

namastrijyotise tubhyam tryaksaya ca trimürtaye ||


which is not the same as the text in kamakoti-Latin.txt 
<https://github.com/tesseract-ocr/langdata/files/1284400/kamakoti-Latin.txt> 
that 
ShreeShrii obtained.

Help much appreciated, thanks.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/9731f965-c355-4c1b-b4c4-60606dd99654%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to