I am using Tesseract Version 3.02 for development in Windows. I am running 
OCR on an image and I get different results from the Tesseract API I write 
in my development environment and by executing Tesseract from command line.

Sample output from command line (which is the right one):
“t3DMarkLogic"
Tony Agresta
Worldwide VP, Field Operations
MarkLogic Corporation -
7950 Jones Branch Drive +1 703 854 8531 Phone
Suite 200 +1 703 854 8510 Fax
McLean, VA 22107 +1 443 253 6810 Mobile
- www.marklogic.com tony.agresta[at]marklogic.com

Sample output from the Tesseract API code (contains a lot of junk 
characters):
_ - _ _ V, f’ ._,«.-f A‘ . V ,//7-fir     /1”" /’..i../¢_'7*"'
 ca : .- '  8 4
‘D MarkLogic"   1
Tony Agresta . 5
Worldwide VP, Field Operations 1
* MarkLogic Corporation - V 
’ - 7950 Jones Branch Drive +1 703 854 8531 Phone
Suite 200 +1 703 854 8510 Fax
McLean, VA 22107 +1 443 253 6810 Mobile
- www.marklogic.com tony.agresta[at]marklogic.com


Why are the two outputs different? And what should I change so that the 
output from my code matches the output produced for the same image from the 
command line?

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to