[tesseract-ocr] Re: Training tesseract on non-letter symbols

Tom Morris Tue, 10 May 2016 08:53:27 -0700

This sounds more like a job for OpenCV or some other machine vision library.


Tom

On Tuesday, May 10, 2016 at 3:13:36 AM UTC-4, Piotr Gryta wrote:
>
> Hi everyone,
> I am devoloping a Java application to vectorize a raster image. One of the 
> steps is symbol recognition and I was hoping to train Tesseract to find 
> them and return their pixel coordinates. 
> My question is: 
> 1) Is it possible to make a dictionary of symbols to avoid detection of 
> letters contained in English dictionary?
> 2) What steps should I perform?
> I managed to make a box files for my training image, but later I get an 
> Empty page! error.
> I am glad for any suggestion,
> Piotrek
>
> Here is a sample image of tree sybols which I would like to train to check 
> if it works:
> https://gyazo.com/85a1db80f92f2df44625875bcf20d37d
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/3813ef66-cb7f-4d66-8c9a-bba07c43966f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Re: Training tesseract on non-letter symbols

Reply via email to