Re: [tesseract-ocr] Re: problem detected using tesseract4 & arabic data

2017-03-29 Thread El Fakir Zakaria
thank you for your concern over this matter, your work is really important and much appreciated. 2017-03-29 23:34 GMT+01:00 Ray Smith : > Thanks for spotting this! > I understand why it makes this error, but it will take some thought to fix > it properly! > It is using a

[tesseract-ocr] Re: problem detected using tesseract4 & arabic data

2017-03-29 Thread Ray Smith
Thanks for spotting this! I understand why it makes this error, but it will take some thought to fix it properly! It is using a sort by x-position to re-order the boxes for RTL language training, but that doesn't work in the case of heavily kerned characters like ل in your example. It needs to