Hi, I am trying to fine tuning process for arabic diacritics. I just add one char shaddad ّ to arbitrary word. I got first this error:
Normalization failed for string 'َّس' then Can't encode transcription: 'روص :ليجستلا هذه ،ةطساوب مالَّسلا ميوقتلا ال« ىلوألا' in language '' In the wiki page, I read Encoding of string failed! results when the text string for a training image cannot be encoded using the given unicharset. but How I can fix it? Thanks in advance, Fahad -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/2dee0b0c-a892-4664-9a05-f9e2b266cce0%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

