Re: Counting Devanagari Aksharas

2017-04-20 Thread Shriramana Sharma via Unicode
Hello Richard. Yes my earlier reply wasn't intended to be offlist. I have near-zero knowledge about non-Indic languages. All I can say is that Tamil script has eschewed most consonant cluster ligatures/conjoining forms. As for Devanagari, writing श्रीमान्‌को (I used ZWNJ) i.o. श्रीमान्को is quite

Version linking?

2017-08-17 Thread Shriramana Sharma via Unicode
A propos http://blog.unicode.org/2017/08/unicode-emoji-60-initial-drafts-draft.html I would like to know whether it is intended that Emoji version N will be always targeted at Unicode version N + 5 and published in year N + 2012. I did not find the question or answer at

Re: Version linking?

2017-08-17 Thread Shriramana Sharma via Unicode
Thanks for your reply, but how can characters be used portably if they are not part of the published standard yet? Or is it that hereafter both Unicode Standard + Unicode Emoji Standard will be parallelly portable or something like that? -- Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा

Wagging finger emoji?

2017-07-10 Thread Shriramana Sharma via Unicode
Hello. Searching UnicodeData.txt for emoji-s with the word "finger" I am getting: 1F590;RAISED HAND WITH FINGERS SPLAYED;So;0;ON;N; 1F591;REVERSED RAISED HAND WITH FINGERS SPLAYED;So;0;ON;N; 1F595;REVERSED HAND WITH MIDDLE FINGER EXTENDED;So;0;ON;N; 1F596;RAISED HAND WITH

Re: Turtle Graphics Emoji

2017-07-28 Thread Shriramana Sharma via Unicode
for animal in animalKingdom: createEmojiProposal(animal) ☺ Emoji are a veritable Pandora box.

Ah the power of emoji! To encompass even science and mythology!

2017-08-23 Thread Shriramana Sharma via Unicode
 <-- lunar eclipse  <-- solar eclipse  <-- apocalypse https://twitter.com/AstroKatie/status/518697246305439745 ☺ Not new (2014) but I hadn't seen this till today and felt it a propos re the recent pair of eclipses. Shriramana Sharma.

Re: Unicode education in Schools

2017-08-24 Thread Shriramana Sharma via Unicode
So how do you think it matters if the characters are in the BMP or SMP?

RE: Unicode education in Schools

2017-08-24 Thread Shriramana Sharma via Unicode
IIUC the limitation seems to be only that functions such as "charAt" do not recognize that surrogates aren't valid characters: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/charAt via https://stackoverflow.com/a/8716157/1503120. This is a problem of many

Re: Assamese and Unicode.

2017-09-05 Thread Shriramana Sharma via Unicode
On 9/5/17, Martin J. Dürst via Unicode wrote: > The best thing to do is to have lot's of content in Assamese in Unicode. > This will show that things just work. IIUC the problem is with Assamese not accepting the label "Bengali" to "their" script. AFAICS they do not deny

\b and Indic word boundaries?

2017-12-02 Thread Shriramana Sharma via Unicode
Hello. Yesterday I reported https://bugs.python.org/issue32198 but then was pointed to already existing https://bugs.python.org/issue1693050 and friends. >From reading these I came to find \b under https://unicode.org/reports/tr18/#Compatibility_Properties. I confess I don't entirely grok all

Popular wordprocessors treating U+00A0 as fixed-width

2017-12-31 Thread Shriramana Sharma via Unicode
While http://unicode.org/reports/tr14/ clearly states that: When expanding or compressing interword space according to common typographical practice, only the spaces marked by U+0020 SPACE and U+00A0 NO-BREAK SPACE are subject to compression, and only spaces marked by U+0020 SPACE, U+00A0

Emoji for major planets at least?

2018-01-18 Thread Shriramana Sharma via Unicode
Hello people. We have sun, earth and moon emoji (3 for the earth and more for the moon's phases). But we don't have emoji for the rest of the planets. We have astrological symbols for all the planets and a few non-existent imaginary "planets" as well. Given this, would it be impractical to

Re: 0027, 02BC, 2019, or a new character?

2018-01-16 Thread Shriramana Sharma via Unicode
Rejecting the digraph method (which is probably the simplest) doesn't have much meaning because they have different sounds in different languages all the time like ch in English and German. Anyhow, it certainly can be difficult convincing non technical political people. Modifier letters are more

Re: 0027, 02BC, 2019, or a new character?

2018-01-19 Thread Shriramana Sharma via Unicode
t via Unicode <unicode@unicode.org> > wrote: > > > > On 19 January 2018 at 09:16, Shriramana Sharma via Unicode > > <unicode@unicode.org> wrote: > >> Wow. Somebody really needs to convey this to the Kazhaks. Else a > >> short-sighted decision would rui

Re: 0027, 02BC, 2019, or a new character?

2018-01-19 Thread Shriramana Sharma via Unicode
Wow. Somebody really needs to convey this to the Kazhaks. Else a short-sighted decision would ruin their chances at native IDNs. Any Kazhaks on this list? On 19-Jan-2018 00:23, "Asmus Freytag via Unicode" wrote: > Top level IDN domain names can not contain 02BC, nor 0027 or

Re: 0027, 02BC, 2019, or a new character?

2018-01-26 Thread Shriramana Sharma via Unicode
But your outgoing "From" address doesn't seem to have an accent!? On 26-Jan-2018 13:58, "Andre Schappo via Unicode" wrote: > > Talking of typing names correctly. Few people bother to type the acute > accent in André. > > This academic year, for the first time ever, I gave

Re: 0027, 02BC, 2019, or a new character?

2018-01-24 Thread Shriramana Sharma via Unicode
On 23-Jan-2018 10:03, "James Kass via Unicode" wrote: (bottle, east, skier, crucial, cherry) s'i's'a, s'yg'ys, s'an'g'ys'y, s'es'u's'i, s'i'i'e sxixsxa, sxygxys, sxanxgxysxy, sxesxuxsxi, sxixixe s̈ïs̈a, s̈yg̈ys, s̈an̈g̈ys̈y, s̈es̈üs̈i, s̈ïïe śíśa, śyǵys,

Re: 0027, 02BC, 2019, or a new character?

2018-01-24 Thread Shriramana Sharma via Unicode
On 24-Jan-2018 00:25, "Doug Ewell via Unicode" wrote: I think it's so cute that some of us think we can advise Nazarbayev on whether to use straight or curly apostrophes or accents or x's or whatever. Like he would listen to a bunch of Western technocrats. Sir why this

Re: Emoji blooper

2018-02-13 Thread Shriramana Sharma via Unicode
To illustrate… -- Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा ူ၆ိျိါအူိ၆ါး

Emoji blooper

2018-02-13 Thread Shriramana Sharma via Unicode
Recently sent this message to a friends list:  Apparently one font has the trumpet facing left and one has it facing right! So before hitting Send in GMail's web interface, the text appeared fine but after doing so, in my browser it is showing as if the music is emanating from the back of

Re: Why so much emoji nonsense?

2018-02-14 Thread Shriramana Sharma via Unicode
>From a mail which I had sent to two other Unicode contributors just a few days ago: Frankly I agree that this whole emoji thing is a Pandora box. It should have been restricted to emoticons to express facial or physical gestures which are insufficiently representable by words. When it starts

UNICODE vehicle vanity registration?

2018-02-14 Thread Shriramana Sharma via Unicode
Given that in the US vanity vehicle registrations with arbitrary alphanumeric sequences upto 7 characters are permitted (I am correct I hope?), I wonder who (here?) owns the UNICODE registration? -- Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा ူ၆ိျိါအူိ၆ါး

Re: UNICODE vehicle vanity registration?

2018-02-14 Thread Shriramana Sharma via Unicode
Sorry but "UNICODE" does fit within those rules doesn't it? On 14-Feb-2018 21:54, "Stephane Bortzmeyer" <bortzme...@nic.fr> wrote: On Wed, Feb 14, 2018 at 09:44:06PM +0530, Shriramana Sharma via Unicode <unicode@unicode.org> wrote a message of 6 lines which said

Re: UNICODE vehicle vanity registration?

2018-02-14 Thread Shriramana Sharma via Unicode
On 14-Feb-2018 22:45, "Alastair Houghton" wrote: I’d hope that Mark Davis has “UNICODE” on his car. However, I’m not sure how relevant it really is to this mailing list. You're right. My apologies. It *is* somewhat OT to the actual purpose of this list. But I

Usage of emoji in coding contexts!

2018-08-08 Thread Shriramana Sharma via Unicode
First time I'm seeing this (maybe others have seen this already): https://github.com/wei/pull Emoji being used in commit messages for classifying the nature of the commit – bug fixes, feature additions etc Now *that*'s a nice creative usage of emoji IMO… I see they haven't used them always as

Re: Tamil Brahmi Short Mid Vowels

2018-07-20 Thread Shriramana Sharma via Unicode
This is a unique problem because this is probably the only case where the same script produces conjuncts for one language and not for another. I had asked for a separate Tamil Brahmi virama to be encoded which would obviate this problem but that was shot down. Maybe that case should be reopened?

Shortcuts question

2018-09-06 Thread Shriramana Sharma via Unicode
Hello. This may be slightly OT for this list but I'm asking it here as it concerns computer usage with multiple scripts and i18n: 1) Are shortcuts like Ctrl+C changed as per locale? I mean Ctrl+T for "tout" io Ctrl+A for "all"? 2) How about when the shortcuts are the Alt+ combinations referring

Re: 0027, 02BC, 2019, or a new character?

2018-01-19 Thread Shriramana Sharma via Unicode
Announcing: Much ado about apostrophes A Play By William Codesphere Coming soon to a theatre near you... 

Excessive emoji usage and TTS (was Re: A last missing link)

2019-01-10 Thread Shriramana Sharma via Unicode
On Thu 10 Jan, 2019, 20:49 Arthur Reutenauer via Unicode < unicode@unicode.org wrote: > > On this topic, I was just pointed to > > https://twitter.com/kentcdodds/status/1083073242330361856 > > “You 혵혩혪혯혬 it's 풸퓊퓉ℯ to 현헿헶혁헲 your tweets and usernames > 햙햍햎햘 햜햆햞. But > have you 홡홞홨황홚홣홚홙

Emoji boom?

2019-05-01 Thread Shriramana Sharma via Unicode
http://www.unicode.org/L2/L-curdoc.htm The number of emoji-related proposals seems to be increasing compared to the number of script-related ones. Have we reached a plateau re scripts encoding? Somehow this seems sad to me considering the great role Unicode played in bringing Indic scripts

Re: Script_extension Property of U+0310 Combining Candrabindu

2019-04-19 Thread Shriramana Sharma via Unicode
On 4/19/19, Richard Wordingham via Unicode wrote: > That's a fair point. My problem is that someone is claiming of > U+0310 that "Somewhere in the Unicode specifications is a footnote > saying it is to be used with Devanagari". Why would anyone want to use 0310 with any Indic script that

Re: Latin Script Danda

2019-04-19 Thread Shriramana Sharma via Unicode
We are using the pipe character as it is readily available in our favourite Latin script fonts. See for example: https://twitter.com/ShriramanaS/status/793480884116529152 It would be ideal for Sanskrit/Indic text in IAST/ISO to be displayable/printable using any common Latin font which is found

Re: Fw: Latin Script Danda

2019-04-19 Thread Shriramana Sharma via Unicode
I don't know many modern fonts that display 007C as a broken glyph. In fact I haven't seen a broken line pipe glyph since the MS-DOS days. Nowadays we have 00A6 for that.

NBSP supposed to stretch, right?

2019-12-16 Thread Shriramana Sharma via Unicode
Hello. I've just tested LibreOffice, Google Docs and MS Office on Linux, Android and Windows, and it seems that NBSP doesn't get stretched like the normal space character when justified alignment requires it. Let me explain. I'm creating a document with the following text typeset in 12 pt Lohit

Re: NBSP supposed to stretch, right?

2019-12-17 Thread Shriramana Sharma via Unicode
On Tue 17 Dec, 2019, 16:09 QSJN 4 UKR via Unicode, wrote: > Agree. > By the way, it is common practice to use multiple nbsp in a row to > create a larger span. In my opinion, it is wrong to replace fixed > width spaces with non-breaking spaces. > Quote from Microsoft Typography Character design

Not accepted by UTC but in ISO ballot?

2019-12-20 Thread Shriramana Sharma via Unicode
I was looking at the pipeline for something else, and for the first time I see a character category: “not accepted by the UTC but in ISO ballot” and two characters in it. So IIUC while technically people are free to submit a document to the ISO separately without submitting to UTC, it has always

Re: NBSP supposed to stretch, right?

2019-12-20 Thread Shriramana Sharma via Unicode
On 12/21/19, Richard Wordingham via Unicode wrote: > On Fri, 20 Dec 2019 17:25:17 +0530 > Shriramana Sharma via Unicode wrote: > >> I don't expect NBSP to ever disappear, because spaces disappear only >> at linebreaks, and NBSP simply doesn't stand at linebreaks. > &

Long standing problem with Vedic tone markers and post-base visarga/anusvara

2019-12-20 Thread Shriramana Sharma via Unicode
https://github.com/harfbuzz/harfbuzz/issues/2017 should provide the context for this. Ever since the early days of Devanagari Unicode, scholars like me dealing with Vedic Sanskrit orthography have been experiencing this problem, but chalked it upto early days and consequent insufficient support

Re: [EXTERNAL] Re: NBSP supposed to stretch, right?

2019-12-20 Thread Shriramana Sharma via Unicode
On 12/21/19, Murray Sargent wrote: > I checked with the Word team and they actually tried out stretching NBSP > back in 2015 in the "good client" mode. But customer feedback was negative. > The problem is that NBSP is used sometimes when stretching isn't wanted such > as between the end of a

Re: [EXTERNAL] Re: NBSP supposed to stretch, right?

2019-12-20 Thread Shriramana Sharma via Unicode
On 12/21/19, Shriramana Sharma wrote: > 1) > > With the existing single NBSP character, provide a software option to > either make it flexible or inflexible, but this preference should be > stored as part of the document and not the application settings, else > shared documents would not preserve

Re: NBSP supposed to stretch, right?

2019-12-21 Thread Shriramana Sharma via Unicode
On 12/19/19, James Kass via Unicode wrote: > > There's a bug report for the LibreOffice application here... > https://bugs.documentfoundation.org/show_bug.cgi?id=41652 > ...which shows an interesting history of the situation. LOL two years ago almost to the date Shriramana Sharma seems to have

Re: Re: NBSP supposed to stretch, right?

2019-12-22 Thread Shriramana Sharma via Unicode
So I was wondering whether TeX only does this to the ~ input character or the actual NBSP Unicode character too?

Re: emojis for mouse buttons?

2019-12-31 Thread Shriramana Sharma via Unicode
Why are these called "emojis" for mouse buttons rather than just "characters" for them? On Tue, 31 Dec, 2019, 18:45 Philippe Verdy via Unicode, wrote: > A lot of application need to document their keymap and want to display > keys. > > For now there are emojis for mouses (several variants: 1, 2

Re: Not accepted by UTC but in ISO ballot?

2019-12-27 Thread Shriramana Sharma via Unicode
Dec, 2019, 07:19 Ken Whistler, wrote: > Shriramana, > > On 12/20/2019 6:29 PM, Shriramana Sharma via Unicode wrote: > > I was looking at the pipeline for something else, and for the first > > time I see a character category: “not accepted by the UTC but in ISO > > ballot”