Hello,
I am trying to use the Tessnet2 OCR Library in a C# program. But the
results are *very very* off.
The following is my code in a C# console project, the Program.cs class:
static void Main(string[] args)
{
try
{
Bitmap image = new Bitmap(@
"C:\Users\hp\Desktop\eurotext.tif");
var ocr = new Tesseract();
//When I tried to add SetVariable(), it still gave a wrong
output
ocr.Init(@"C:\Program Files (x86)\Tesseract-OCR", "eng",
true);
var result = ocr.DoOCR(image, Rectangle.Empty);
foreach (Word word in result)
Console.WriteLine("{0} : {1}", word.Confidence, word.
Text);
Console.ReadLine();
}
catch (Exception exception)
{
Console.WriteLine("Error");
}
}
For this image (a large binary 300 dpi image):
<https://lh5.googleusercontent.com/-oGnFgwbDp_I/VKfMS3sWRgI/AAAAAAAAADg/svhk8KIYwqI/s1600/eurotext.tif>
This is the Tessnet2 output:
<https://lh5.googleusercontent.com/-EaZhrqE93QM/VKfM-EXdPxI/AAAAAAAAADo/WyJFmMS4pJM/s1600/Untitled.png>
I have been searching everywhere for a solution to this, to try to increase
the accuracy of the OCR. But, no luck.
I am a beginner in this topic, so please bear with me if the solution to
this problem is too trivial.
Thanks!
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/d141f28d-f2d7-4d50-a597-ee606fbeb494%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.