The main problem was with the OCR, not the encoding. I’ve OCR’ed it with ABBYY 
FineReader 10 (which handles Hebrew consonants), and uploaded it to 
https://www.box.com/s/xgfz4ek5uteybwl2dzee

Ken M. Penner, Ph.D.
Associate Professor, Religious Studies
2329 Notre Dame Avenue, 409 Nicholson Tower
St. Francis Xavier University
Antigonish, NS  B2G 2W5
Canada
(902)867-2265
[email protected]



From: [email protected] 
[mailto:[email protected]] On Behalf Of John Leake
Sent: Friday, June 07, 2013 7:32 PM
To: K Randolph
Cc: B-Hebrew
Subject: Re: [b-hebrew] Query about Davidson's Analytical Lexicon

I'm not convinced, Karl, I think the Hebrew is gibberish. Consistent gibberish 
and (due to the sort of digitisation as part of a page layout, well-ordered 
gibberish), but gibberish. The people who do the 
archive.org<http://archive.org> digitisation don't really seem to pay much 
attention to language. They don't scan even Latin  with any attempt to treat it 
as Latin, and Greek too becomes nonsense. But the source image files are there 
if someone has good digitisation software.

Let's be grateful, though, that the stuff is there at all!

John
----------------------------------
ان صاحب حياة هانئة لا يدونها انما يحياها
He who has a comfortable life doesn't write about it - he lives it
----------------------------------

On 7 Jun 2013, at 22:48, K Randolph 
<[email protected]<mailto:[email protected]>> wrote:
The gibberish doesn’t appear to be problems with OCR, rather incompatible 
encoding. I’ve worked with OCR, and have seen when it messes up, but this 
doesn’t look like OCR mess-up.
_______________________________________________
b-hebrew mailing list
[email protected]
http://lists.ibiblio.org/mailman/listinfo/b-hebrew

Reply via email to