Hello,
Im using Tesseract OCR for Urdu Nastalique script..and there are many similar characters that are being misrecognized...im sure there must be some flags in Tesseract that would solve my problem someday...An example of the misrecognition is given below, where Tesseract returns the character on the right side, as a recognition output of the character on the left side: <https://lh3.googleusercontent.com/-pOaY13abZs8/UMg3xu4kpjI/AAAAAAAAAB0/Q1Az-HsMU5M/s1600/G_HFL_%D8%B3%D8%B9%D8%A8%DB%81_368_F14_CC_1.bmp><https://lh6.googleusercontent.com/-DhzQOI7DQ8w/UMg3-jDbd6I/AAAAAAAAAB8/5_Wxe_Ueuss/s1600/G_HFL_%D9%85%D8%B9%D9%85%DB%81_4316_F14_L_1_CC_1.bmp> Can anyone please guide me about the flags that can be altered in order to overcome this problem?? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

