Re: Moving The Hebrew Extended Block Into The SMP

Mark Shoulson Tue, 10 May 2016 19:48:07 -0700

On 05/10/2016 09:08 PM, Robert Wheelock wrote:

·U+30000—U+30014 (21 codepoints): Additional characters fortypesetting Biblical/Classical Hebrew

Do you have this list available yet? I'm curious about these points,and others.

·U+30015—U+3001F (11 codepoints): Palestinian vowel and pronunciationpoints for Hebrew and Galilean Aramaic·U+30020—U+30021 (2 codepoints): Small superscript top-left signs forthe letter /shin/—superscript śin and superscript shin

I thought SIN was indicated sometimes by a SAMEKH written above theletter. How would putting a SIN (which is just a SHIN with a dot on theleft instead of the right) on top of the letter be any improvement (ordifference) over just putting the dot on the left of the base letter inthe first place?

·U+30022—U+30041 (32 codepoints): Palestinian cantillation signs forHebrew and Galilean Aramaic
·U+30042 is reserved
·U+30043—U+3005C (26 codepoints): Babylonian vowel and pronunciationpoints for Hebrew
·U+3005D—U+3005F are reserved
·U+30060—U+30071 (18 codepoints): Babylonian cantillation signs forHebrew
·U+30072—U+3007D are reserved
·U+3007E—U+3008F (18 codepoints): Samaritan vowel points,pronunciation points, and cantillation signs for Hebrew (copies ofthose also being used for Samaritan script in BMP)

OK, here I'm confused. Why do we need copies? Unicode doesn't like toencode redundant things, and it only makes for messes (when do you usewhich ZIQAA?) If we have the characters in the BMP, we don't need themin the SMP.

·U+30090—U+3010F (128 codepoints): Additional characters in Hebrewscript for other Jewish languages (these are pointed like thecorresponding Arabic characters in the BMP)

So additional Hebrew "letters" that take Arabic vowel-points? Makessense; I saw some of that with Samaritan (particularly with DAMMA). Weshould probably just use the Arabic vowel code-points though.

·U+30110—U+3012F (32 codepoints): Basic Hebrew superscript characters(regular letters+5 final forms+top-left pointed /śin/+top-rightpointed /shin/+/maqqef/)·U+30130—U+3014F (32 codepoints): Basic Hebrew subscript characters(regular letters+5 final forms+top-left pointed /śin/+top-rightpointed /shin/+/maqqef/)

When you say "superscript" (or "subscript"), do you mean "spacingcharacter that's written small and raised/lowered"? Or do you mean"combining character that's written above/below another character"? cf.the difference between U+2071 SUPERSCRIPT LATIN SMALL LETTER I andU+0365 COMBINING LATIN SMALL LETTER I). If the former, is there areason this has to be done as plain-text and can't be handled byhigher-level markup? Probably every major script has been written smalland high in some places, but we don't have superscript versions of everyletter in Unicode.



~mark

Re: Moving The Hebrew Extended Block Into The SMP

Reply via email to