[tesseract-ocr] Unable to read correct text from an Image

2016-11-07 Thread Gaurav Sharma
Hi, I am trying to read the text from an Image(Format: PNG). I am not getting the accurate text from that image. Please find the attahced images which contains the Graph Image and screenshot for extracted text from graph image. Note: it reads digit 8 as 3. Can someone please help to get the

[tesseract-ocr] Unable to get the correct text from an PNG image

2016-11-07 Thread Gaurav Sharma
Hi, I am trying to get the text from an Image (Format: PNG). I am able to get the text but its not accurate as per shown in image. Please find the attahced graph image (NewGraph.png) and extracted text from image(Extracted_Text_From_Image.png). Note: Input image is a graph image. In this

[tesseract-ocr] Script Detection

2016-11-07 Thread rkvsraman
Hello, I tried to detect the script of the above bengali image with command tesseract ben.png bensc - -psm 0 and i get following output in bensc.osd which detects the the script as Latin. Page number: 0 Orientation in degrees: 90 Rotate: 270 Orientation confidence: 1.48 Script: Latin

[tesseract-ocr] Match text output to uzn

2016-11-07 Thread blubzel
Hi, i am using tesseract to extract data from tables in documents. Therefor i specify the zones for all cells in an uzn file. I can match the extracted data to the individual cells, if there are no empty cells. But if there are empty cells they are not represented in the output text file. So