If all your images are in this bold thick font, fine tuning for impact may
help with some of the recognition errors.

On Tue, 27 Aug 2019, 14:42 Stephane Charette, <[email protected]>
wrote:

> I have a large number of images that contain a single line of alphanumeric
> data.  My scans so far have not been great, and I could use some assistance.
>
> Several vars are turned off as recommended in the docs:
>
>     key.push_back("load_system_dawg");
>     val.push_back("false");
>     key.push_back("load_freq_dawg");
>     val.push_back("false");
>
>
> These are set at initialization:
>
>     tess->Init(nullptr, "eng", tesseract::OEM_DEFAULT, nullptr, 0, &key, 
> &val, false);
>     tess->SetPageSegMode(tesseract::PageSegMode::PSM_SINGLE_LINE);
>
>
> Some images are close, such as this one:
>
> [image: "32 EC 5"]
> ...which is interpreted as "SZ2EC 3".
>
> Other like this one return a blank string:
>
> [image: "30 B 9"]
> And then I have some like this one which is so close, but Tesseract
> removes the spaces between the letters, so this example results in "1201":
>
> [image: "12 O 1"]
> I've posted my full .cpp test file and more example images showing the
> problem on StackOverflow:
> https://stackoverflow.com/questions/57670769/how-to-get-tesseract-to-recognize-these-alphanumeric-strings
>
> Thanks,
>
> Stéphane
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/f721e105-d0d6-4322-b9c5-6c5f2d487d06%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/f721e105-d0d6-4322-b9c5-6c5f2d487d06%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWHBWa9icUifJATc5dWAEWoF_c90%3Dixgj%2BLJeKXZ2cCRw%40mail.gmail.com.

Reply via email to