Hello Richard. Yes my earlier reply wasn't intended to be offlist. I
have near-zero knowledge about non-Indic languages.
All I can say is that Tamil script has eschewed most consonant cluster
ligatures/conjoining forms. As for Devanagari, writing श्रीमान्को (I
used ZWNJ) i.o. श्रीमान्को is quite
A propos
http://blog.unicode.org/2017/08/unicode-emoji-60-initial-drafts-draft.html
I would like to know whether it is intended that Emoji version N will
be always targeted at Unicode version N + 5 and published in year N +
2012.
I did not find the question or answer at
Thanks for your reply, but how can characters be used portably if they
are not part of the published standard yet? Or is it that hereafter
both Unicode Standard + Unicode Emoji Standard will be parallelly
portable or something like that?
--
Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा
Hello. Searching UnicodeData.txt for emoji-s with the word "finger" I
am getting:
1F590;RAISED HAND WITH FINGERS SPLAYED;So;0;ON;N;
1F591;REVERSED RAISED HAND WITH FINGERS SPLAYED;So;0;ON;N;
1F595;REVERSED HAND WITH MIDDLE FINGER EXTENDED;So;0;ON;N;
1F596;RAISED HAND WITH
for animal in animalKingdom:
createEmojiProposal(animal)
☺
Emoji are a veritable Pandora box.
<-- lunar eclipse
<-- solar eclipse
<-- apocalypse
https://twitter.com/AstroKatie/status/518697246305439745
☺
Not new (2014) but I hadn't seen this till today and felt it a propos re
the recent pair of eclipses.
Shriramana Sharma.
So how do you think it matters if the characters are in the BMP or SMP?
IIUC the limitation seems to be only that functions such as "charAt" do not
recognize that surrogates aren't valid characters:
https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/charAt
via https://stackoverflow.com/a/8716157/1503120.
This is a problem of many
On 9/5/17, Martin J. Dürst via Unicode wrote:
> The best thing to do is to have lot's of content in Assamese in Unicode.
> This will show that things just work.
IIUC the problem is with Assamese not accepting the label "Bengali" to
"their" script. AFAICS they do not deny
Hello. Yesterday I reported https://bugs.python.org/issue32198 but
then was pointed to already existing
https://bugs.python.org/issue1693050 and friends.
>From reading these I came to find \b under
https://unicode.org/reports/tr18/#Compatibility_Properties.
I confess I don't entirely grok all
While http://unicode.org/reports/tr14/ clearly states that:
When expanding or compressing interword space according to common
typographical practice, only the spaces marked by U+0020 SPACE and
U+00A0 NO-BREAK SPACE are subject to compression, and only spaces
marked by U+0020 SPACE, U+00A0
Hello people.
We have sun, earth and moon emoji (3 for the earth and more for the
moon's phases). But we don't have emoji for the rest of the planets.
We have astrological symbols for all the planets and a few
non-existent imaginary "planets" as well.
Given this, would it be impractical to
Rejecting the digraph method (which is probably the simplest) doesn't have
much meaning because they have different sounds in different languages all
the time like ch in English and German.
Anyhow, it certainly can be difficult convincing non technical political
people.
Modifier letters are more
t via Unicode <unicode@unicode.org>
> wrote:
> >
> > On 19 January 2018 at 09:16, Shriramana Sharma via Unicode
> > <unicode@unicode.org> wrote:
> >> Wow. Somebody really needs to convey this to the Kazhaks. Else a
> >> short-sighted decision would rui
Wow. Somebody really needs to convey this to the Kazhaks. Else a
short-sighted decision would ruin their chances at native IDNs. Any Kazhaks
on this list?
On 19-Jan-2018 00:23, "Asmus Freytag via Unicode"
wrote:
> Top level IDN domain names can not contain 02BC, nor 0027 or
But your outgoing "From" address doesn't seem to have an accent!?
On 26-Jan-2018 13:58, "Andre Schappo via Unicode"
wrote:
>
> Talking of typing names correctly. Few people bother to type the acute
> accent in André.
>
> This academic year, for the first time ever, I gave
On 23-Jan-2018 10:03, "James Kass via Unicode" wrote:
(bottle, east, skier, crucial, cherry)
s'i's'a, s'yg'ys, s'an'g'ys'y, s'es'u's'i, s'i'i'e
sxixsxa, sxygxys, sxanxgxysxy, sxesxuxsxi, sxixixe
s̈ïs̈a, s̈yg̈ys, s̈an̈g̈ys̈y, s̈es̈üs̈i, s̈ïïe
śíśa, śyǵys,
On 24-Jan-2018 00:25, "Doug Ewell via Unicode" wrote:
I think it's so cute that some of us think we can advise Nazarbayev on
whether to use straight or curly apostrophes or accents or x's or
whatever. Like he would listen to a bunch of Western technocrats.
Sir why this
To illustrate…
--
Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा ူ၆ိျိါအူိ၆ါး
Recently sent this message to a friends list:
Apparently one font has the trumpet facing left and one has it facing
right! So before hitting Send in GMail's web interface, the text
appeared fine but after doing so, in my browser it is showing as if
the music is emanating from the back of
>From a mail which I had sent to two other Unicode contributors just a
few days ago:
Frankly I agree that this whole emoji thing is a Pandora box. It
should have been restricted to emoticons to express facial or physical
gestures which are insufficiently representable by words. When it
starts
Given that in the US vanity vehicle registrations with arbitrary
alphanumeric sequences upto 7 characters are permitted (I am correct I
hope?), I wonder who (here?) owns the UNICODE registration?
--
Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा ူ၆ိျိါအူိ၆ါး
Sorry but "UNICODE" does fit within those rules doesn't it?
On 14-Feb-2018 21:54, "Stephane Bortzmeyer" <bortzme...@nic.fr> wrote:
On Wed, Feb 14, 2018 at 09:44:06PM +0530,
Shriramana Sharma via Unicode <unicode@unicode.org> wrote
a message of 6 lines which said
On 14-Feb-2018 22:45, "Alastair Houghton"
wrote:
I’d hope that Mark Davis has “UNICODE” on his car. However, I’m not sure
how relevant it really is to this mailing list.
You're right. My apologies. It *is* somewhat OT to the actual purpose of
this list. But I
First time I'm seeing this (maybe others have seen this already):
https://github.com/wei/pull
Emoji being used in commit messages for classifying the nature of the
commit – bug fixes, feature additions etc
Now *that*'s a nice creative usage of emoji IMO…
I see they haven't used them always as
This is a unique problem because this is probably the only case where the
same script produces conjuncts for one language and not for another. I had
asked for a separate Tamil Brahmi virama to be encoded which would obviate
this problem but that was shot down. Maybe that case should be reopened?
Hello. This may be slightly OT for this list but I'm asking it here as it
concerns computer usage with multiple scripts and i18n:
1) Are shortcuts like Ctrl+C changed as per locale? I mean Ctrl+T for
"tout" io Ctrl+A for "all"?
2) How about when the shortcuts are the Alt+ combinations referring
Announcing:
Much ado about apostrophes
A Play
By
William Codesphere
Coming soon to a theatre near you...
On Thu 10 Jan, 2019, 20:49 Arthur Reutenauer via Unicode <
unicode@unicode.org wrote:
>
> On this topic, I was just pointed to
>
> https://twitter.com/kentcdodds/status/1083073242330361856
>
> “You 혵혩혪혯혬 it's 풸퓊퓉ℯ to 현헿헶혁헲 your tweets and usernames
> 햙햍햎햘 햜햆햞. But
> have you 홡홞홨황홚홣홚홙
http://www.unicode.org/L2/L-curdoc.htm
The number of emoji-related proposals seems to be increasing compared
to the number of script-related ones.
Have we reached a plateau re scripts encoding?
Somehow this seems sad to me considering the great role Unicode played
in bringing Indic scripts
On 4/19/19, Richard Wordingham via Unicode wrote:
> That's a fair point. My problem is that someone is claiming of
> U+0310 that "Somewhere in the Unicode specifications is a footnote
> saying it is to be used with Devanagari".
Why would anyone want to use 0310 with any Indic script that
We are using the pipe character as it is readily available in our
favourite Latin script fonts. See for example:
https://twitter.com/ShriramanaS/status/793480884116529152
It would be ideal for Sanskrit/Indic text in IAST/ISO to be
displayable/printable using any common Latin font which is found
I don't know many modern fonts that display 007C as a broken glyph. In fact
I haven't seen a broken line pipe glyph since the MS-DOS days. Nowadays we
have 00A6 for that.
Hello. I've just tested LibreOffice, Google Docs and MS Office on
Linux, Android and Windows, and it seems that NBSP doesn't get
stretched like the normal space character when justified alignment
requires it.
Let me explain. I'm creating a document with the following text
typeset in 12 pt Lohit
On Tue 17 Dec, 2019, 16:09 QSJN 4 UKR via Unicode,
wrote:
> Agree.
> By the way, it is common practice to use multiple nbsp in a row to
> create a larger span. In my opinion, it is wrong to replace fixed
> width spaces with non-breaking spaces.
> Quote from Microsoft Typography Character design
I was looking at the pipeline for something else, and for the first
time I see a character category: “not accepted by the UTC but in ISO
ballot” and two characters in it.
So IIUC while technically people are free to submit a document to the
ISO separately without submitting to UTC, it has always
On 12/21/19, Richard Wordingham via Unicode wrote:
> On Fri, 20 Dec 2019 17:25:17 +0530
> Shriramana Sharma via Unicode wrote:
>
>> I don't expect NBSP to ever disappear, because spaces disappear only
>> at linebreaks, and NBSP simply doesn't stand at linebreaks.
>
&
https://github.com/harfbuzz/harfbuzz/issues/2017 should provide the
context for this.
Ever since the early days of Devanagari Unicode, scholars like me
dealing with Vedic Sanskrit orthography have been experiencing this
problem, but chalked it upto early days and consequent insufficient
support
On 12/21/19, Murray Sargent wrote:
> I checked with the Word team and they actually tried out stretching NBSP
> back in 2015 in the "good client" mode. But customer feedback was negative.
> The problem is that NBSP is used sometimes when stretching isn't wanted such
> as between the end of a
On 12/21/19, Shriramana Sharma wrote:
> 1)
>
> With the existing single NBSP character, provide a software option to
> either make it flexible or inflexible, but this preference should be
> stored as part of the document and not the application settings, else
> shared documents would not preserve
On 12/19/19, James Kass via Unicode wrote:
>
> There's a bug report for the LibreOffice application here...
> https://bugs.documentfoundation.org/show_bug.cgi?id=41652
> ...which shows an interesting history of the situation.
LOL two years ago almost to the date Shriramana Sharma seems to have
So I was wondering whether TeX only does this to the ~ input character or
the actual NBSP Unicode character too?
Why are these called "emojis" for mouse buttons rather than just
"characters" for them?
On Tue, 31 Dec, 2019, 18:45 Philippe Verdy via Unicode,
wrote:
> A lot of application need to document their keymap and want to display
> keys.
>
> For now there are emojis for mouses (several variants: 1, 2
Dec, 2019, 07:19 Ken Whistler, wrote:
> Shriramana,
>
> On 12/20/2019 6:29 PM, Shriramana Sharma via Unicode wrote:
> > I was looking at the pipeline for something else, and for the first
> > time I see a character category: “not accepted by the UTC but in ISO
> > ballot”
44 matches
Mail list logo