[tesseract-ocr] Re: Simple Tesseract OCR in .NET 4+?

2017-02-28 Thread Cory Blissitte
Building on Quan Nguyens suggestion:

CharlesWs Tesseract wrapper project is a pretty easy thing to work with.  
In its simplest form you can do the following provided the image file is 
saved to your filesystem:

var engine = new 
TesseractEngine(Path.Combine(AppDomain.CurrentDomain.BaseDirectory, 
@"tessdata"), "eng",
EngineMode.TesseractOnly)
{
DefaultPageSegMode = PageSegMode.AutoOsd
};

var pageOutput = engine.Process(Pix.LoadFromFile(fileName));

var hOcr = pageOutput.GetHOCRText(0);
var imageText= pageOutput.GetText();

The hOcr string is an HTML document that contains the text and placement of 
that text on the page (most useful for incorporation into searchable PDFs.  
The imageText string is just the recognized text from the image.


Cory

On Monday, February 27, 2017 at 9:37:47 PM UTC-10, Cetor Notorious wrote:
>
> Hi everybody,
>
> I was wondering if anyone had a tutorial / example code that is really 
> simple.
> It just needs to recognize text from a webimage, and return the recognized 
> text.
>
> I would like to make it where I can have this entire piece in one DLL so 
> it's easy to use.
>
> Is anyone able to help me?
>
> Have a wonderful day :)
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c70cf56f-5672-4809-bd22-7a96a0f60455%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Simple Tesseract OCR in .NET 4+?

2017-02-28 Thread Quan Nguyen
Check out .NET wrapper for Tesseract:

https://github.com/charlesw/tesseract

On Tuesday, February 28, 2017 at 1:37:47 AM UTC-6, Cetor Notorious wrote:
>
> Hi everybody,
>
> I was wondering if anyone had a tutorial / example code that is really 
> simple.
> It just needs to recognize text from a webimage, and return the recognized 
> text.
>
> I would like to make it where I can have this entire piece in one DLL so 
> it's easy to use.
>
> Is anyone able to help me?
>
> Have a wonderful day :)
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/01457812-b969-405d-8c41-65422e1e945a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.