Re: [tesseract-ocr] Tesseract-OCR Training Arabic text & numbers

2020-07-14 Thread Eliyaz L
Hi sorry to bother, just a follow up. i tried the latest tesseract its working fine with the arabic text and numbers but the only issue is with arabic date, so if the issue is still open, can i prepare dataset and train a separate custom model for only numbers and date. if possible then pls

Re: [tesseract-ocr] Tesseract-OCR Training Arabic text & numbers

2020-07-14 Thread Shree Devi Kumar
@Eliyaz I do not know Arabic or any other RTL. I suggest you try running training with the latest code and tesstrain. You may have to experiment to get the best result. I will try to do a test run with the data you provided, does it include numbers and dates? On Tue, Jul 14, 2020, 13:18 Eliyaz L

Re: [tesseract-ocr] Tesseract makes different predictions on seemingly equal images. How to make it more robust?

2020-07-14 Thread Zdenko Podobny
Try to use the latest version of tesseract. Zdenko ut 14. 7. 2020 o 16:04 MysteriousGuy napĂ­sal(a): > I am using Tesseract to extract text from images attached. For some > reason, even though the images are nearly identical, tesseract makes a > mistake in one of them: for 'bad.png' the output

[tesseract-ocr] Tesseract makes different predictions on seemingly equal images. How to make it more robust?

2020-07-14 Thread MysteriousGuy
I am using Tesseract to extract text from images attached. For some reason, even though the images are nearly identical, tesseract makes a mistake in one of them: for 'bad.png' the output is ELHADIJ, whereas for 'good.png' it is ELHADJ Here is what I have and done: - tesseract version:

[tesseract-ocr] tessaract ocr on capcha images--how to perform well?

2020-07-14 Thread Omar Hasan
Hello! I am trying to run ocr on capcha images. well, for normal images tessaract performs well, but for images below attachments, it performs bad, i mean can not properly recognize. can you please help me? I am a beginner. Note that, I have already used some opencv tools like grayscale,