To quote the Tesseract Wiki <https://github.com/tesseract-ocr/tesseract/wiki/Training-Tesseract#background-and-limitations> :
Don't try to train Tesseract versions earlier than 4.0 for Arabic (same for > Persian, Urdu, etc.). It's hopeless. For 4.0 only train with the LSTM > method > <https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM#training-tesseract-lstm-engine> > . > On Monday, March 21, 2016 at 4:19:37 AM UTC-4, RJ wrote: > > > <https://lh3.googleusercontent.com/-rGnp-XhRVzM/Vu-svW0L3UI/AAAAAAAAAgc/Yzp6-ZgoCIo7iyoPQRWB_h7ZAGeq_lHKQ/s1600/Peace.png> > Hello All, > > I am using tesseract 3.02 for Arabic language. I using command line > options to read the image. > *tesseract.exe "D:\Peace.png" D:\output.txt -l ara -psm 7* > > > <https://lh3.googleusercontent.com/-rGnp-XhRVzM/Vu-svW0L3UI/AAAAAAAAAgc/Yzp6-ZgoCIo7iyoPQRWB_h7ZAGeq_lHKQ/s1600/Peace.png> > But i got output ( ال*نللا ثم ) *different to the input image. Is there > any configuration required? > > > Thanks in advance > RJ > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/67ad7493-f1b8-4df9-a5fb-3993d3a1bbf8%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

