That particular dictionary has already been OCRed with Abbyy Fine Reader: http://archive.org/stream/everymansenglish00jone/everymansenglish00jone_djvu.txt
Although not perfect, a little cleanup would render that text quite usable. --Sven On Wed, Jan 16, 2013 at 8:44 AM, Sven Pedersen <[email protected]>wrote: > You would need to train tesseract to recognize those symbols. The web page > outlines how to do that. > --Sven > > > On Tue, Jan 15, 2013 at 6:43 PM, <[email protected]> wrote: > >> Is Tesseract-OCR capable of recognizing phonetic symbols? I would like to >> extract the phonetic transcriptions of the following (out of copyright) >> document >> http://archive.org/stream/everymansenglish00jone#page/2/mode/2up >> >> Regards, >> >> - Olumide >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > > > -- > ``All that is gold does not glitter, > not all those who wander are lost; > the old that is strong does not wither, > deep roots are not reached by the frost. > From the ashes a fire shall be woken, > a light from the shadows shall spring; > renewed shall be blade that was broken, > the crownless again shall be king.” > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

