At http://www.elspell.gr/myspell there is OpenOffice Greek Dictionary v0.9 <http://elspell.math.upatras.gr/files/ooffice/el_GR-0.9.zip> with 800.000 greek words encoded with windows-1253, under MPL 1.1/GPL 2.0/LGPL 2.1 License.
Polytonic characters aren't used after 1982 and we don't have wordlists for them. Only sources like the Bible have polytonic words but they don't belong to modern greek. The maintainer of tesseract-ocr-grc uses a wordlist based on ancient greek polytonic texts. The greek polytonic unicode characters U+1F00 to U+1FFC aren't useful in the packet tesseract-ocr-ell, and they may confuse ocr recognition. On the opposite side tesseract-ocr-grc must have the polytonic characters and not the monotonic greek characters U+0386 to U+03CE. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0c016239-d530-4ae7-9328-b1787d91d15f%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.