The main problem was with the OCR, not the encoding. I’ve OCR’ed it with ABBYY FineReader 10 (which handles Hebrew consonants), and uploaded it to https://www.box.com/s/xgfz4ek5uteybwl2dzee
Ken M. Penner, Ph.D. Associate Professor, Religious Studies 2329 Notre Dame Avenue, 409 Nicholson Tower St. Francis Xavier University Antigonish, NS B2G 2W5 Canada (902)867-2265 [email protected] From: [email protected] [mailto:[email protected]] On Behalf Of John Leake Sent: Friday, June 07, 2013 7:32 PM To: K Randolph Cc: B-Hebrew Subject: Re: [b-hebrew] Query about Davidson's Analytical Lexicon I'm not convinced, Karl, I think the Hebrew is gibberish. Consistent gibberish and (due to the sort of digitisation as part of a page layout, well-ordered gibberish), but gibberish. The people who do the archive.org<http://archive.org> digitisation don't really seem to pay much attention to language. They don't scan even Latin with any attempt to treat it as Latin, and Greek too becomes nonsense. But the source image files are there if someone has good digitisation software. Let's be grateful, though, that the stuff is there at all! John ---------------------------------- ان صاحب حياة هانئة لا يدونها انما يحياها He who has a comfortable life doesn't write about it - he lives it ---------------------------------- On 7 Jun 2013, at 22:48, K Randolph <[email protected]<mailto:[email protected]>> wrote: The gibberish doesn’t appear to be problems with OCR, rather incompatible encoding. I’ve worked with OCR, and have seen when it messes up, but this doesn’t look like OCR mess-up.
_______________________________________________ b-hebrew mailing list [email protected] http://lists.ibiblio.org/mailman/listinfo/b-hebrew
