On 28/10/10 14:48, Tzadik Vanderhoof wrote:
I have a binary data file, in a format used by a relatively ancient
program, which I am trying to convert into something sane. With the
help of a Hex editor I have basically worked out the file format
except that it contains Hebrew characters with an odd encoding.
All characters are 8 bits. The "standard" 27 consonants (including
"final" consonants) go from hex *80* to *9A*. Then there are *vowels*
that seem to start around hex *9B* or so (I'm guessing right after the
standard consonants end). Then there are *"dotted" consonants* that
seem to start at hex *E0*.
If I remember correctly, I think this is some sort of DOS encoding
(perhaps connected to the old "Hebrew chip"). Does anyone have a table
of this character mapping or a tool that will translate this mapping
into a more normal Hebrew encoding like Unicode?
Thanks so much!
--
And I should add:
iconv -f cp862 < file > output
--
Shachar Shemesh
Lingnu Open Source Consulting Ltd.
http://www.lingnu.com
_______________________________________________
Linux-il mailing list
[email protected]
http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il