[tesseract-ocr] Re: Can't encode transcription

Des Bw Tue, 26 Sep 2023 08:38:28 -0700

Are you planning to fine tune for a specific font, or want to improve the 
overall accuracy of the best model?


On Tuesday, September 26, 2023 at 6:35:38 PM UTC+3 Des Bw wrote:

> I am also training for Amharic. 
> I am pretty sure you are using Windows OS. I had exactly the same problem 
> with it. It think it is contingent with Unicode. But, I was not able to 
> solve the issue. I now installed Ubuntu on the side; and everything works 
> fine. 
>
> On Tuesday, September 26, 2023 at 12:25:40 PM UTC+3 [email protected] 
> wrote:
>
>> I am new to tesseract and I have tried to train a Tesseract model for 
>> Amharic language
>>  
>> and it never stops when it starts like this
>> Can't encode transcription: 'ህ' in language '' Encoding of string failed! 
>> Failure bytes: ffffffe1 ffffff8d ffffffad
>>
>>
>> anybody aware of this problem and how can I fine tune amh.traineddata? I 
>> have followed this tutorial GitHub - livezingy/tesstrain-win: Train 
>> Tesseract LSTM with make on Windows 
>> <https://github.com/livezingy/tesstrain-win/tree/master>
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/d04a748f-d84d-426e-8fe7-f8ab774bb195n%40googlegroups.com.

[tesseract-ocr] Re: Can't encode transcription

Reply via email to