ubuntu@tesseract-ocr:~/TEST$ tesseract fixed.png - --psm 6 --dpi 300 --tessdata-dir ~/tessdata_fast PASS wee ubuntu@tesseract-ocr:~/TEST$ tesseract fixed.png - --psm 6 --dpi 300 --tessdata-dir ~/tessdata_best PASS AYE ubuntu@tesseract-ocr:~/TEST$ tesseract fixed.png - --psm 6 --dpi 300 --tessdata-dir ~/tessdata PASS A\ 8 ubuntu@tesseract-ocr:~/TEST$ tesseract fixed.png - --psm 6 --dpi 300 --tessdata-dir ~/tessdata --oem 1 PASS AYE ubuntu@tesseract-ocr:~/TEST$ tesseract fixed.png - --psm 6 --dpi 300 --tessdata-dir ~/tessdata --oem 0 PASS FAIL
Looks like `neural net tesseract` performs worse than `base tesseract` in this case. On Mon, Apr 29, 2019 at 10:41 PM Jason <[email protected]> wrote: > Apologies for such a simple question but this is a super simple test case > and I don't understand why it isn't working. This simple image contains the > words "PASS" and "FAIL". "PASS" is recognized but "FAIL" comes out as > "wee". What can I do to get it to detect "FAIL" properly? > > I'm using the demo CPP code, ideally I would like to provide an "approved" > word list or be able to remove "wee" and hopefully, it'll match to "FAIL". > I saw > https://github.com/tesseract-ocr/tesseract/blob/master/doc/tesseract.1.asc#config-files-and-augmenting-with-user-data > but > that looks like it's just for the command line utility? How would I go > about that int he C++ API? > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/71bab02c-ba21-49dc-8e99-710d52075207%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/71bab02c-ba21-49dc-8e99-710d52075207%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUVDACpXsnqnF3JNTfq5vJfA4jTd1sXO0qs9BteNpKeWw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

