Thanks for the advice, Sven! I'll see what I can do.

Kind regards,

Terrence

On Jan 10, 9:45 am, Sven Pedersen <[email protected]> wrote:
> You should try cube (multiple language) "-l eng+deu" with the European
> scripts to see if a combination works well, but honestly unless this list
> is very long it would be easier to OCR it and correct it by hand for
> scannos (OCR equiv. of typos). Otherwise, you'll need to train starting
> with English -- not from scratch. Follow the FAQ and let us know if you
> have problems.
> -_Sven
>
> On Wed, Jan 9, 2013 at 10:57 PM, TerrenceW <[email protected]
>
>
>
>
>
>
>
>
>
> > wrote:
> > I have an image containing English words with their phonetic
> > equivalents printed alongside (a pronunciation guide.) It's actually
> > for a spelling bee. The organizers of the bee would very much like to
> > get this image as a textual list (so they can manipulate it, pull
> > words from it randomly, and so on) and they came to me for help. I've
> > used Tesseract before, but only to recognize English and French text.
>
> > The pronunciation guide contains text with lots of umlauts, upside
> > down letter e's, etc. More specialized characters than are contained
> > in French (which is one thing I tried, in an effort to improve the
> > recognition.) Does anyone have any advice for recognizing characters
> > like this? Should I start training Tesseract to recognize them? I've
> > never done any training before, which is why I'm a bit reluctant.
>
> > Thanks in advance!
>
> > Terrence
>
> > --
> > You received this message because you are subscribed to the Google
> > Groups "tesseract-ocr" group.
> > To post to this group, send email to [email protected]
> > To unsubscribe from this group, send email to
> > [email protected]
> > For more options, visit this group at
> >http://groups.google.com/group/tesseract-ocr?hl=en
>
> --
> ``All that is gold does not glitter,
>   not all those who wander are lost;
> the old that is strong does not wither,
>   deep roots are not reached by the frost.
> From the ashes a fire shall be woken,
>   a light from the shadows shall spring;
> renewed shall be blade that was broken,
>   the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to