Are you planning to fine tune for a specific font, or want to improve the overall accuracy of the best model?
On Tuesday, September 26, 2023 at 6:35:38 PM UTC+3 Des Bw wrote: > I am also training for Amharic. > I am pretty sure you are using Windows OS. I had exactly the same problem > with it. It think it is contingent with Unicode. But, I was not able to > solve the issue. I now installed Ubuntu on the side; and everything works > fine. > > On Tuesday, September 26, 2023 at 12:25:40 PM UTC+3 [email protected] > wrote: > >> I am new to tesseract and I have tried to train a Tesseract model for >> Amharic language >> >> and it never stops when it starts like this >> Can't encode transcription: 'ህ' in language '' Encoding of string failed! >> Failure bytes: ffffffe1 ffffff8d ffffffad >> >> >> anybody aware of this problem and how can I fine tune amh.traineddata? I >> have followed this tutorial GitHub - livezingy/tesstrain-win: Train >> Tesseract LSTM with make on Windows >> <https://github.com/livezingy/tesstrain-win/tree/master> >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d04a748f-d84d-426e-8fe7-f8ab774bb195n%40googlegroups.com.

