>From what I see, there is no traineddata for the Roman latin
alphabet.  Essentially, the current eng.traineddata's shortcoming is
its lack of the macron diacritic.

Is it possible to add the macron glyphs to the already-existing
eng.traineddata?  (the Ā, ā, Ē, ē, Ō, ō, Ū, ū)

-------------------

On a tangential issue: it's almost comical how, in practice, there is
no easy way to "google" for information on the Roman Empire (and
Catholic Church) Latin language and alphabet and glyphs, because the
other conventional use of the phrase "latin alphabet" (referring to
the modern latinate derivatives and descendants) gets in the way!

I think a new convention and descriptor needs to be established, that
uniquely refers to and denotes the alphabet used by ancient Romans
(the "real" Latin!)... (but then, again, it differs mostly with the
macron :-)



-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to