I am using Tesseract Version 3.02 for development in Windows. I am running OCR on an image and I get different results from the Tesseract API I write in my development environment and by executing Tesseract from command line.
Sample output from command line (which is the right one): “t3DMarkLogic" Tony Agresta Worldwide VP, Field Operations MarkLogic Corporation - 7950 Jones Branch Drive +1 703 854 8531 Phone Suite 200 +1 703 854 8510 Fax McLean, VA 22107 +1 443 253 6810 Mobile - www.marklogic.com tony.agresta[at]marklogic.com Sample output from the Tesseract API code (contains a lot of junk characters): _ - _ _ V, f’ ._,«.-f A‘ . V ,//7-fir /1”" /’..i../¢_'7*"' ca : .- ' 8 4 ‘D MarkLogic" 1 Tony Agresta . 5 Worldwide VP, Field Operations 1 * MarkLogic Corporation - V ’ - 7950 Jones Branch Drive +1 703 854 8531 Phone Suite 200 +1 703 854 8510 Fax McLean, VA 22107 +1 443 253 6810 Mobile - www.marklogic.com tony.agresta[at]marklogic.com Why are the two outputs different? And what should I change so that the output from my code matches the output produced for the same image from the command line? -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

