Re: Fw: Latin Script Danda

2019-04-19 Thread Asmus Freytag via Unicode

  
  
On 4/19/2019 6:57 PM, Shriramana Sharma
  via Unicode wrote:


  
  
I don't know many modern fonts that display 007C
  as a broken glyph. In fact I haven't seen a broken line pipe
  glyph since the MS-DOS days. Nowadays we have 00A6 for that.


  

Same here. In fact, couldn't find any
example among installed fonts on a Windows 7 (not  even Windows
10) system before running out of patience. That seems to
indicate that the disunification of vertical bar and broken bar
was complete 10 years ago.
  
A./
  
  



Re: Fw: Latin Script Danda

2019-04-19 Thread Shriramana Sharma via Unicode
I don't know many modern fonts that display 007C as a broken glyph. In fact
I haven't seen a broken line pipe glyph since the MS-DOS days. Nowadays we
have 00A6 for that.


Fw: Latin Script Danda

2019-04-19 Thread Richard Wordingham via Unicode



Begin forwarded message:

Date: Fri, 19 Apr 2019 11:30:32 +0100
From: Richard Wordingham 
To: Shriramana Sharma 
Subject: Re: Latin Script Danda


On Fri, 19 Apr 2019 11:33:35 +0530
Shriramana Sharma via Unicode  wrote:

> We are using the pipe character as it is readily available in our
> favourite Latin script fonts. See for example:
> https://twitter.com/ShriramanaS/status/793480884116529152  

The broken bar glyph of the pipe character does not feel appropriate.  

> It would be ideal for Sanskrit/Indic text in IAST/ISO to be
> displayable/printable using any common Latin font which is found
> typographically pleasant. For instance the font I have used in that
> Twitter post is Gentium Basic. I use this font for most of my Latin
> script publication purposes (including Unicode documents) and it
> contains the pipe character but it does not contain Devanagari
> characters.  

> It would be difficult to canvas Latin font vendors to include the
> Devanagari characters 0964/0965 on a small technicality of character
> property.  

Font designers for many Indic scripts have had to learn that
U+0964 and U+0965 have the script property of 'Common', not
Devanagari.  I don't trust automated font pickers in that respect,
though.

> Is there a particular reason it's *really* necessary to include Latn
> in the script extension property of 0964/0965?  

No more so than including Indian scripts in the list.  There has been a
threat to use the script extension property in breaking text into
script runs, and U+0964 and U+0965 are often better with
script-sensitive forms.

Richard.


Re: Emoji Haggadah

2019-04-19 Thread Philippe Verdy via Unicode
I cannot; definitely it requires first good knowldge of English (to find
possible synonyms, plus phonetic approximations, including using
abbreviatable words), and Hebrew culture (to guess names and the context).
All this text looks completely random and makes no sense otherwise.

Le mar. 16 avr. 2019 à 04:22, Tex via Unicode  a
écrit :

> Oy veh!
>
>
>
> *From:* Unicode [mailto:unicode-boun...@unicode.org] *On Behalf Of *Mark
> E. Shoulson via Unicode
> *Sent:* Monday, April 15, 2019 5:27 PM
> *To:* unicode@unicode.org
> *Subject:* Emoji Haggadah
>
>
>
> The only thing more disturbing than the existence of The Emoji Haggadah (
> https://www.amazon.com/Emoji-Haggadah-Martin-Bodek/dp/1602803463/) is the
> fact that I'm starting to find that I can read it...
>
>
>
> ~mark
>


Re: Script_extension Property of U+0310 Combining Candrabindu

2019-04-19 Thread Richard Wordingham via Unicode
On Fri, 19 Apr 2019 19:54:47 +0530
Shriramana Sharma  wrote:

> Or maybe the Grantha candrabindu can be used, since there is already
> evidence for mixed usage of the scripts and nukta characters have been
> encoded for Tamil usage in the Grantha block for this same reason
> despite Grantha users objecting to it as unattested!

That seems to be the approved solution - the script_extension property
of U+11301 GRANTHA SIGN CANDRABINDU is {Gran, Taml}.

Richard.


Re: Script_extension Property of U+0310 Combining Candrabindu

2019-04-19 Thread Richard Wordingham via Unicode
On Fri, 19 Apr 2019 11:36:16 +0530
Shriramana Sharma via Unicode  wrote:

> On 4/19/19, Richard Wordingham via Unicode 
> wrote:
> > That's a fair point.  My problem is that someone is claiming of
> > U+0310 that "Somewhere in the Unicode specifications is a footnote
> > saying it is to be used with Devanagari".  
> 
> Why would anyone want to use 0310 with any Indic script that already
> has a candrabindu?

I know any such footnote would be wrong.  Disproving it ever existed is
trickier.  I can imagine a statement that it "represents the Devanagari
candrabindu", which could after the passage of years change into the
claim in someone's human memory.

> > However, some people get rather upset with the idea of using the
> > general combining diacritics in Indic scripts.  

> Many Vedic svara characters have lookalikes among the Combining
> Diacritics but they were encoded anyway since IIUC the UTC felt that
> separate characters would help preserving sanity in implementing text
> shaping engines or such.

That reminds me - what if anything is happening about Tamil script
candrabindu? You reported that U+0310 was being used in that rôle.

Richard.



Re: Script_extension Property of U+0310 Combining Candrabindu

2019-04-19 Thread Shriramana Sharma via Unicode
On 4/19/19, Richard Wordingham via Unicode  wrote:
> That's a fair point.  My problem is that someone is claiming of
> U+0310 that "Somewhere in the Unicode specifications is a footnote
> saying it is to be used with Devanagari".

Why would anyone want to use 0310 with any Indic script that already
has a candrabindu?

> However, some people get rather upset with the idea of using the
> general combining diacritics in Indic scripts.

Many Vedic svara characters have lookalikes among the Combining
Diacritics but they were encoded anyway since IIUC the UTC felt that
separate characters would help preserving sanity in implementing text
shaping engines or such.

-- 
Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा ူ၆ိျိါအူိ၆ါး



Re: Latin Script Danda

2019-04-19 Thread Shriramana Sharma via Unicode
We are using the pipe character as it is readily available in our
favourite Latin script fonts. See for example:
https://twitter.com/ShriramanaS/status/793480884116529152

It would be ideal for Sanskrit/Indic text in IAST/ISO to be
displayable/printable using any common Latin font which is found
typographically pleasant. For instance the font I have used in that
Twitter post is Gentium Basic. I use this font for most of my Latin
script publication purposes (including Unicode documents) and it
contains the pipe character but it does not contain Devanagari
characters.

It would be difficult to canvas Latin font vendors to include the
Devanagari characters 0964/0965 on a small technicality of character
property.

Is there a particular reason it's *really* necessary to include Latn
in the script extension property of 0964/0965?


-- 
Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा ူ၆ိျိါအူိ၆ါး