>From what I see, there is no traineddata for the Roman latin alphabet. Essentially, the current eng.traineddata's shortcoming is its lack of the macron diacritic.
Is it possible to add the macron glyphs to the already-existing eng.traineddata? (the Ā, ā, Ē, ē, Ō, ō, Ū, ū) ------------------- On a tangential issue: it's almost comical how, in practice, there is no easy way to "google" for information on the Roman Empire (and Catholic Church) Latin language and alphabet and glyphs, because the other conventional use of the phrase "latin alphabet" (referring to the modern latinate derivatives and descendants) gets in the way! I think a new convention and descriptor needs to be established, that uniquely refers to and denotes the alphabet used by ancient Romans (the "real" Latin!)... (but then, again, it differs mostly with the macron :-) -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

