> > A couple of suggestions, therefore: > > One option: change verbiage from > > > tesseract-2.01.<lang>.tar.gz contains the language data files for > > <lang>. You need at least one of these or tesseract will not work. > > > to > > > tesseract-2.01.<lang>.tar.gz contains the language data files for > > <lang>. You need at least one of these or tesseract will not work. > > Note that this is labeled as v2.01 not v2.04 > > That's a bit redundant, don't you think? I understand where the > confusion crept in, but I don't have a better suggestion, and yours is > not, IMO, any less confusing. > > Care to try again?
How about: > > tesseract-2.01.<lang>.tar.gz (Note that this is from a previous release) > > contains the language data files for > This raises another issue - thanks - which merits its own topic, I think. > > > Third option (my preferred): re-release a new language data file per > > version. Then it makes more sense to the newbie that you have to > > download them concurrently. Currently it feels like an odd > > requirement to cherry pick a download from a previous release. > > Ugh, no. It's absolutely senseless to release the exact same files > multiple times just to cater for people who can't grasp that the > software and its data have independent versions. I'm pretty sure you didn't mean that as inflammatory. Maybe rename them to not be tied with a specific version, then? It seems that those files are meant for more than just v2.01 releases... Here's another suggestion: make the english (or all) training data as featured. Then it will be apparent that the downloads are still applicable. Thoughts? -rp -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

