here is process to create a new Ttraineddata file: t https://tesseract-ocr.github.io/tessdoc/TrainingTesseract-4.00
On Monday, 1 June 2020 12:24:11 UTC+5:30, Prasanta Hembram wrote: > > Their is no tessdata for Santali language :- > > 1. https://github.com/tesseract-ocr/tessdata > 2. https://github.com/tesseract-ocr/langdata > > Though some made it it is an Indic script, > https://github.com/indic-ocr/tessdata/tree/master/sat .. But how i tried > to contact them but no replay. My question is how to create a tessdata and > langdata for this language and how to add this in above repository. I am > new to coding i tried to make sat.traineddata in jtessboxeditor but getting > many errors but i think i can crate one. If someone can help me creating > langdata how to do it. And after creating all those files how can i upload > it to above repository. If anyone can make one i would be thankful > otherwise anyone can guide me to create one. The attached image is having > all script except Ol chiki numbers. > https://en.wikipedia.org/wiki/Ol_Chiki_script > > Thanks. > > <https://github.com/tesseract-ocr/tessdata> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3c417a45-1669-4e13-856a-eaa0a34ea341%40googlegroups.com.