Re: [Mayan EDMS: 612] Most reliable OCR?

Mario César Wed, 07 Aug 2013 07:07:54 -0700

2013/8/7 John Wells <[email protected]>

> I'm evaluating Mayan and like what I see so far. However, I'm finding
> Tesseract's OCR to be pretty inaccurate. What, in your experience, is the
> most reliable OCR engine out there? We're willing to buy a commercial
> engine if necessary. I've googled a bit and have a call set up with ABBYY,
> but would love to have some input on what others have used, or if there's a
> way to make tesseract more reliable.
>


Tesseracts is the best opensource OCR library I had work with. Others are
just to simple or output unusable results.

I will recommend you that learn about Training
https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 In my case
I had a lot of common formatted documents, the first try was unacceptable,
but after doing some training work the results where pretty much 90%
accurate.

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"Mayan EDMS" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Re: [Mayan EDMS: 612] Most reliable OCR?

Reply via email to