On 6/10/2016 5:34 PM, Andrew Cunningham
wrote:
There is the logic of how kikakui numbers are encoded
in Unicode and there is the internal logic of the numeral system
itself. They are not necessarily the same.
This statement should be framed!
A./
On 6/6/2016 9:09 AM, Markus Scherer
wrote:
Interesting discussion!
ICU does not support "is" nor "in"
prefixes. I wasn't even aware that UAX #44 loose matching
prescribes "is". ICU just implements what
On 4/12/2016 11:47 AM, Philippe Verdy
wrote:
Wow! They name that technology "Advanced Weather
Interactive Processing System"...
The announcement is straight from "Dilbert" : ... Service Level
Management Process defines a life cycle of
On 4/8/2016 9:48 AM, Johannes
Bergerhausen wrote:
Dear list,
after 10 years it was time to redesign www.decodeunicode.org
the address with www is not found.
A./
I am pleased to report that a beta is online now. There are still
On 4/1/2016 4:20 AM, Janusz S. "Bień"
wrote:
Both an index and the descriptions can coexist.
Personally I think the index should be the primary source. The problem
is to find or to desing an appropriate software which would allow to use
the available manpower
On 3/29/2016 12:16 AM, Janusz S. "Bień"
wrote:
The document I refer to is a ISO/IEC document. As far as I know, ISO is
quite crazy about copyright. Does the Unicode Consortium policy apply to
this document? If so, then on which principle? An explicit agreement
with
On 3/28/2016 9:40 PM, Janusz S. "Bień"
wrote:
If you seriously wanted to present "all that is known about a
> character" you would need to excerpt all mentions of it in the core
> specification, as well as (potentially) any additional details
> presented
On 3/28/2016 4:59 AM, Mark Davis ☕️ wrote:
The listing has both the block name and the Nameslist subhead label in
listing characters. One can also use the subhead labels in filtering, eg
http://unicode.org/cldr/utility/list-unicodeset.jsp?a=\p{subhead=Archaic%20letters}
On 3/26/2016 2:10 AM, Janusz S. "Bień"
wrote:
On Thu, Mar 10 2016 at 22:40 CET, kenwhist...@att.net writes:
[...]
The *reason* that NamesList.txt exists at all is to drive the tool, unibook,
that formats the full Unicode code charts for posting.
On 3/20/2016 12:11 AM, Janusz S. Bien
wrote:
Quote/Cytat - Andrew Cunningham
(Sun 20 Mar 2016 12:06:29 AM CET):
Hi Don,
Latin is fine if you keep to simple well made fonts and avoid
using
On 3/18/2016 11:48 AM, Philippe Verdy
wrote:
East Asian vertical presentation does not just stack
the elements on top of each other, very frequently they rotate
them (including Latin/Greek/Cyrillic letters) So this is not
really a new complication.
On 3/16/2016 11:11 PM, Philippe Verdy
wrote:
"Disunification may be an answer?" We should avoid
it as well.
Disunification is only acceptable when
- there's a complete disunification of concepts
I
On 3/15/2016 8:14 PM, David Faulks
wrote:
As part of my investigations into astrological symbols, I'm beginning to wonder if glyph variations are justifications for separate encoding of symbols I would have previously considered the same or unifiable with symbols
On 3/18/2016 12:33 PM, Marcel Schneider
wrote:
As about decomposing digraphs and ypogegrammeni to apply swapcase: That probably would be doing no good, as itʼs unnecessary and users wonʼt expect it.
That was my intuition as well, but based on a
On 3/18/2016 11:11 AM, Garth Wallace
wrote:
The enclosure could also be something else than a circle (or arcs of
> circle): it could be a rectangle, hintable with joiners (like with circles)
> to create an enclosing square, or a rounded rectangle (hintable
On 3/15/2016 7:42 AM, Doug Ewell wrote:
Asmus Freytag wrote:
• don't ask for additions or changes
Additions and changes to annotations are considered all the time.
Well, yes. I meant additions and changes
On 3/14/2016 11:22 AM, Doug Ewell wrote:
Ken Whistler wrote:
The trick is this: the status of annotational data in NamesList.txt
is different than that of normative data like the code points, names,
formal name aliases, decomposition mappings, and standardized
variation sequences.
I get that.
On 3/14/2016 11:33 AM, Janusz S. Bien wrote:
Quote/Cytat - Doug Ewell (pon, 14 mar 2016, 19:22:14):
Ken Whistler wrote:
The trick is this: the status of annotational data in NamesList.txt
is different than that of normative data like the code points, names,
formal name
On 3/12/2016 10:55 PM, Janusz S. "Bień"
wrote:
In fact, the possibility of reuse in this context probably among the
> unstated rationales for making the information and syntax available in
> the first place.
I understand there is no intention
On 3/10/2016 5:49 PM, "J. S. Choi" wrote:
One thing about NamesList.txt is that, as far as I have been able to tell, it’s
the only machine-readable, parseable source of those annotations and
cross-references.
There are explanations about character use that are only maintained in
the PDF of
On 3/10/2016 2:14 PM, Doug Ewell wrote:
Ken Whistler wrote:
NamesList.txt should *not* be data mined.
And yet it was the only Unicode data file utilized by MSKLC.
There are many possible reasons for this approach, which we will
probably
On 3/9/2016 7:08 PM, Oren Watson wrote:
I was
surprised to find out that there are gaps in the
Mathematical alphanumeric symbols block (U+1d400 to
u+1d7ff). The gaps are associated with the inclusion of
On 3/7/2016 12:58 PM, Peter Constable
wrote:
I
know you’re not proposing anything and just providing info
for discussion. I want to make sure it’s clear to others
that there is no requirement for
On 3/4/2016 8:51 AM, Doug Ewell wrote:
Pierpaolo Bernardi wrote:
And the NamesList.txt of Unicode Character Database gives the
description: Japanese Hinamatsuri or girls' doll festival. Aren't
they the authorities to let the emoji look like
usage remains for emotions ; starting in the 1970's these were
ASCII art symbols such as the famous :-)
2016-02-29 23:24 GMT+01:00 Asmus Freytag (t) <asmus-...@ix.netcom.com>:
On 2/29/2016 1:55 PM, Philippe Verdy wrote:
. Well emojis were initially designed to track
On 2/29/2016 1:55 PM, Philippe Verdy wrote:
. Well emojis were initially designed to track amotions and form a
sort of new language,
E-moji means "picture-character" in Japanese, has nothing to do (at
first) with emotions.
A./
On 2/28/2016 11:14 PM, Tex Texin wrote:
However, how any of this belongs on the Unicode list is beyond me.
Surely we do not need to comment on every use of emoji that occurs in
the media.
But there you are mistaken, my dear sir!
We are constantly told that the discussions on this list
On 2/28/2016 9:04 PM, Karl Williamson
wrote:
http://abc27.com/2016/02/27/girl-12-charged-for-threatening-emojis/
"The mother says the girl shouldn’t have been charged."
In civilized countries 12-year-olds would be considered too young to
On 2/21/2016 8:22 AM, Eli Zaretskii
wrote:
From: "Asmus Freytag (t)" <asmus-...@ix.netcom.com>
Date: Sat, 20 Feb 2016 14:10:04 -0800
What about language-independent character-folding: where in the
Unicode data
On 2/20/2016 9:56 AM, Eli Zaretskii
wrote:
From: Philippe Verdy
Date: Sat, 20 Feb 2016 18:27:41 +0100
Cc: unicode Unicode Discussion
Unless we have case folding tailored by language, you cannot do that based on
Mats,
it is really useful to document (let me amend that to "immensely
useful" to document) the required combinations (perhaps in a
Unicode Technical Note on African Writing Systems). It may even be
useful to ask for these sequences to become "named
in the discussions here.
A./
Thank You!
On Mon, Feb 15, 2016 at 2:29 PM, Asmus
Freytag (t) <asmus-...@ix.netcom.com>
wrote:
On 2/15/2016 9:32 AM, Doug
On 2/15/2016 9:32 AM, Doug Ewell wrote:
Asmus Freytag wrote:
with the non-standard symbols like the copyleft, there's the desire to
not encode stuff based on "passing activism".
David Faulks wrote:
The samples I
On 2/14/2016 7:42 PM, António
Martins-Tuválkin wrote:
On
2016.02.15 00:53, Asmus Freytag (t) wrote:
The key issue is whether this usage is
"established".
You can always make the case that what
On 2/14/2016 5:18 PM, Michael Everson
wrote:
On 15 Feb 2016, at 00:53, Asmus Freytag (t) <asmus-...@ix.netcom.com> wrote:
On 2/14/2016 3:36 PM, David Faulks wrote:
Hello,
This subject has been discussed before, bu
On 2/14/2016 3:36 PM, David Faulks
wrote:
Hello,
This subject has been discussed before, but I am somehwat uncertain about something:
If the copyleft (reversed ©) symbol was proposed for encoding, with examples (from PDF files) showing it being used in a similar
On 2/11/2016 6:05 AM, QSJN 4 UKR wrote:
I can show an example of use both, prime (as soft sign) and apostroph
(hemisoft) in Cyrilic-based phonetic transcription (Orthoepic
Dictionary of Ukrainian, http://padaread.com/?book=84816=6
http://padaread.com/?book=84816=7)
On 2/9/2016 1:36 PM, Ken Whistler
wrote:
On 2/9/2016 1:23 PM, David Faulks wrote:
Perhaps Unicode could create a ‘default
position’ property for combining characters, and encourage
OpenType and other font engines to adopt
On 2/9/2016 3:01 PM, Ken Whistler wrote:
Just adapt IndicPositionalCategory.txt for Unibook, and you've got
what you need.
I see. Not quite as simple; Unibook needs overrides that are
specifically able to correct bad fonts, not just "dumb" ones. We may
want to honor some part of the
On 2/8/2016 5:47 PM, Michael Everson
wrote:
It’s what I was taught as the scientific romanization for Russian and Slavic in general.
Michael Everson * http://www.evertype.com/
Source?
A./
On 2/8/2016 6:39 PM, Charlie Ruland
wrote:
Am 09.02.2016 schrieb Asmus Freytag (t):
On 2/8/2016 5:47 PM, Michael
Everson wrote:
It’s what I was taught as the scientific romanization for Russian
On 2/7/2016 1:02 PM, Chris Jacobs wrote:
David Faulks schreef op 2016-02-07 21:20:
If the members of this mailing list think a proposal including a
separate Eris symbol is acceptable, I will include it in my proposal.
Along with, perhaps, some additional symbols...
A./
David
Seems
On 2/7/2016 3:09 PM, Frédéric Grosshans
wrote:
In a
few words (more details below), I think this character is actually
used beyond optics should be encoded as an emoji with properties
(and aspect) similar to U+1F441 EYE, with a name like EYE SIDE
On 2/6/2016 6:11 AM, David Faulks
wrote:
Hello,
I'm investigating the possibility of adding more astrology symbols to Unicode. There is a branch of Western Astrology known as ‘Uranium Astrology’, or the ‘Hamburg School’, which among other things uses a set of 8
character sets is beyond a standard
curriculum, except perhaps History of Computing or Digital Archaeology :)
Teach them right the first time. They’ll never use a code page.
+1
A./
-Shawn
*From:*Unicode [mailto:unicode-boun...@unicode.org] *On Behalf Of
*Asmus Freytag (t)
*Sent:* January 6
ing on the bunny slope at the UTF-8 ski resort. Slap on your snowboard and practice -- get out there onto the 2-, 3- and 4-byte slopes with the experts!
--Ken
On 1/6/2016 4:09 AM, Andre Schappo wrote:
On 4 Jan 2016, at 16:59, Asmus Freytag (t) wrote:
ASCII sho
question is a variant of the greek letter tau (capital or
lowercase).
The identification to τ is from Asmus Freytag, not me.
Mine is a concurring opinion based on ME's suggestion, but corroborated,
in my view, by the systematic notational conventions and not merely
informed by visual sim
On 1/4/2016 10:41 AM, Michael Everson
wrote:
Certainly it does look more like a very common variant of “tau”
than “pi”
Variant of uppercase tau?
A./
On 1/4/2016 12:15 AM, "Jörg Knappen"
wrote:
Here is a report of a rather strange beast occurring in
historical math printing (work of C. F. Gauß) in thw 19th
century:
my suggestion of pi.
Raymond
From: Asmus Freytag (t)
Se
On 1/4/2016 7:49 AM, Michael Everson
wrote:
Excellent!
Looks like a candidate character for encoding. I’m sure I have some examples of good font designs for the old character in one of my books.
Admitting that a Greek letter inherently makes more
On 1/4/2016 12:06 AM, "Jörg Knappen"
wrote:
Err... in what respect would this symbol be different
from a CAPITAL GREEK LETTER GAMMA?
--Jörg Knappen
A sans-serif L would not have
On 1/4/2016 6:44 AM, Elizabeth J. Pyatt
wrote:
Like some others on the list, I believe Unicode should be mentioned at different points in a programming curriculum, particularly at the time when ASCII would be taught.
ASCII shouldn't be taught, perhaps?
On 1/1/2016 6:00 AM, Andre Schappo
wrote:
Julian,
We have very different POVs on this topic. You raise a number of issues which would take me many many thousands of words to properly discuss. I will attempt a summary discussion of some of the issues.
① IT i18n
On 12/29/2015 4:01 AM, Michael Everson wrote:
On 28 Dec 2015, at 23:48, Asmus Freytag (t) <asmus-...@ix.netcom.com> wrote:
Rather than engage in reflexive ad-hoc unification like this, it would be
useful to find out why U+2142 was disunified from TOP RIGHT CORNER and any
other symbols
On 12/28/2015 1:26 PM, Jonathan Coxhead
wrote:
On 2015-12-25 5:43am, Costello, Roger L. wrote:
Hi Folks,
Here is the upside down capital L, pointing to the left:
⅂ - TURNED SANS-SERIF CAPITAL L (U+2142)
Bing is pathetic. It treats the letter as if it didn't exist
Google maps it to the lowercase, neither allows you to find sites
that use just that character.
A./
On 12/9/2015 11:57 PM, "Jörg Knappen"
wrote:
Since the captial sharp s is easily available to the
public, I see it popping up everywhere in
German publications, mostly in an all caps environment. I
have a small
On 12/9/2015 9:52 AM, Gerrit Ansmann
wrote:
After
the German spelling reform in 1996, "ß" then became a letter of
its own, and words containing the letter "ß" are no longer
equivalent to words containing an "ss" combination instead of
On 12/9/2015 3:49 PM, Hans Meiser
wrote:
Yes, they do it wrong because (1) they don't know better and (2) they let their software convert lower case text into upper case (a feature nearly every typographic software provides).
Yet, if we let the majority of
On 12/9/2015 1:11 PM, Michael Everson
wrote:
On 9 Dec 2015, at 20:57, Gerrit Ansmann wrote:
Proposal: Shouldn't the glyph be amended to match the natural language?
Nothing of this is really
On 11/27/2015 11:55 AM, Plug Gulp
wrote:
Hi,
The Unicode standard 8.0 states in chapter 23, section titled "Cursive
Connection and Ligatures"(printed page #814, PDF page #850) that:
"The zero width joiner and non-joiner characters are designed for use
in plain
On 11/27/2015 5:42 PM, Martin J. Dürst
wrote:
On
2015/11/28 04:55, Plug Gulp wrote:
The Unicode standard 8.0 states in chapter
23, section titled "Cursive
Connection and Ligatures"(printed page #814, PDF page #850)
On 11/26/2015 12:10 AM, "Jörg Knappen"
wrote:
I wonder how this concept relates to mathematical
notation, especially the root sign.
It doesn't, for several reasons.
One, because mathematical
On 11/26/2015 2:41 AM, Philippe Verdy
wrote:
However the proposal for these prepended concatenation marks
does not give any hint about how to compute the extent of the
following clusters above/over/below/around which they will apply
(do they
On 11/26/2015 3:08 AM, Philippe Verdy
wrote:
The related definition for extended grapheme
clusters says:
(
CRLF
| Prepend* (
RI-sequence | Hangul-Syllable | !Control )
On 11/26/2015 4:29 AM, Philippe Verdy wrote:
2015-11-26 12:38 GMT+01:00 Asmus Freytag (t) <asmus-...@ix.netcom.com
<mailto:asmus-...@ix.netcom.com>>:
On 11/26/2015 3:08 AM, Philippe Verdy wrote:
The related definition for extended grapheme clusters says:
( CRLF
On 10/22/2015 9:54 AM, Rick McGowan wrote:
Personally, I think you're getting ahead of yourself. First, you
should demonstrate that you have done research and produced results
that at least some people find so useful and important that they are
eager to implement the findings. Then, once you
When it comes to methods operating on
buffers there's always the tension between viewing the buffer as
text elements vs. as data elements. For some purposes, from error
detection to data cleanup you need to be able to treat the buffer
as data elements. For many
On 10/16/2015 12:18 AM, Marcel
Schneider wrote:
On Thu, 15 Oct 2015 15:46:46 -0700, Leo
Broukhis wrote:
> Along the same lines, should I be able to change
my last name
> officially to Ƃpyxᴎc? (NB all
On 10/6/2015 7:31 AM, Julian Bradfield
wrote:
All browsers I use display spaces in input boxes, and put blobs for
hidden fields. Do you have evidence for broken input fields?
Network keys. That interface seems to consistently give people a
choice
On 10/6/2015 5:24 AM, Sean Leonard
wrote:
And,
why did Unicode deem it necessary to replicate the C1 block at
0x80-0x9F, when all of the control characters (codes) were equally
reachable via ESC 4/0 - 5/15? I understand why it is desirable to
align
On 10/5/2015 7:51 AM, Philippe Verdy wrote:
Not silently ! Even if this removal is required to go on editing, this
must be notified to the user as it may occur in unedited parts of the
file (and it may be the sign that the document is not fully plain
text, so the user should not save the
On 10/4/2015 6:02 AM, Richard
Wordingham wrote:
In the absence of a specific tailoring, is the combination of a lone
surrogate and a combining mark a user-perceived character? Does a lone
surrogate constitute a user-perceived character?
In an editing
On 10/4/2015 12:38 PM, Richard
Wordingham wrote:
On Sun, 4 Oct 2015 10:50:43 -0700
Markus Scherer wrote:
I would not spend any time specifying intricate rules for unpaired
surrogates in 16-bit strings, or out-of range values
On 10/4/2015 5:30 AM, Sean Leonard
wrote:
On
10/3/2015 12:28 PM, Asmus Freytag (t) wrote:
On 10/3/2015 8:15 AM, Sean Leonard wrote:
Thanks.
Well, "DIS 10646" is the Draft Internationa
On 10/4/2015 2:35 PM, Richard
Wordingham wrote:
However my opinion is that ᒏ�ᒺ (using U+FFFD substitution) gives 2
> grapheme clusters, I would prefer a solution that gives 3 grapheme
> clusters, as if the lone surrogate was a line-break control, so that
On 10/4/2015 4:14 PM, Richard
Wordingham wrote:
respect to what to erase or undo.
For sequences that belong to a given language, you can pick the
behavior that makes most sense in them, but for lone surrogates, by
definition you are dealing
On 10/3/2015 8:15 AM, Sean Leonard
wrote:
Thanks.
Well, "DIS 10646" is the Draft International Standard,
particularly Draft 1, from ~1990 or ~1991. (Sometimes it might
have been called 10646.1.) Therefore it would likely only be in
On 9/29/2015 8:40 PM, Sean Leonard
wrote:
I like the definition of "character" in ASCII:
3.3 Character. A member of a set of elements used for the
organization, control, or representation of data.
This, by the way, is the
On 9/15/2015 6:45 PM, Daniel Bünzli
wrote:
Hello,
Is there any guidance on how to combine the information given by grapheme clusters and the east asian width property to do fixed-width layouts in terminal emulators ?
For example if we have:
U+AC01 ( 각 )
On 9/10/2015 11:04 AM, Peter Constable
wrote:
I was having an offline discussion with
someone regarding certain topics that may show up on this list
on occasion, and the question came up of what evidence we
Mark,
it is implied the String Range formulation is a compact form.
Can you prove that it doesn't create any set of strings that can't
be specified in other ways (other than full enumeration of the
strings?).
What about set operations on sets with
On 9/6/2015 10:10 PM, Martin J. Dürst
wrote:
Hello
Ken,
You write "The bitcoin sign and baht symbol are two unrelated
symbols that have some visual similarity.", but don't really give
any supporting information for that claim.
On 9/6/2015 11:23 PM, Richard
Wordingham wrote:
On Thu, 03 Sep 2015 09:32:42 -0700
Rick McGowan wrote:
A proposed update to the LDML specification (UTS #35) will be
available for review as of Monday, September 7 at 06:00 GMT. The
On 9/3/2015 9:15 AM, Shawn Steele
wrote:
If
we have a bunch of ingredients emoji, then do yeast + grain
+ hops emoji combine into beer emoji?
Only if you add heat, time and water
Daniel Bünzli wrote:
Since I implement parts of the Unicode standard I'm interested in
keeping in touch with discussions about the standard and its evolution
from a technical point of view.
I'm however not interested in the encoding point of view and
On 9/3/2015 2:09 AM, Marcel Schneider
wrote:
Asmus already told us that there'll be no
soy beans emoji, by lack of iconicity. However, could there be a
generic BEANS emoji
A coffee bean has a very recognizable shape,
and I personally would
On 9/1/2015 9:37 AM, Doug Ewell wrote:
I have no idea whether my proposal is more or less serious, or more or
less likely to be adopted, than the original.
Well, you didn't consider that each style of
beer may be served in a different style glass. :)
On 8/25/2015 12:07 PM, Richard Wordingham wrote:
On Tue, 25 Aug 2015 09:54:29 +0100 (BST)
William_J_G Overington wjgo_10...@btinternet.com wrote:
Richard Wordingham wrote:
On Mon, 24 Aug 2015 11:00:32 +0100 (BST)
William_J_G Overington wjgo_10...@btinternet.com wrote:
Looking at the
On 8/22/2015 2:47 PM, Richard
Wordingham wrote:
But codepoints are normally orderly until they enter the ISO approval
process. Thereafter, disorder creeps in, and becomes ever more likely
as blocks fill up
Haha, good one.
. The
On 8/22/2015 9:35 AM, Julian Bradfield
wrote:
There is no inherent meaning to the
order of codepoints, it's just convenience.
And for that reason, we have property files to
explicitly give the properties rather than asking the user to "glean"
On 8/18/2015 6:18 AM,
alexwei...@alexweiner.com wrote:
"PUA"?
Private use area.
A./
PS: "underbar" is Swedish for "wonderful". Go figure.
Original Message
Subject: RE: APL Under-bar Characters
From:
On 8/16/2015 6:57 PM,
alexwei...@alexweiner.com wrote:
Bug APL,
After much discussion with The Unicode
Consortium Mailing List,
Can we use this to give the characters
unique names?
On 8/14/2015 11:50 AM, Ken Whistler
wrote:
Garth,
The glyphs for the chess symbols in the 26XX block date from
Unicode 3.0. Most of the symbols redesigned for the Unicode 3.0
charts were done by John M. Fiscella. (See the font
On 8/14/2015 1:26 PM, Garth Wallace
wrote:
Would it be acceptable if I extracted the font from the code chart PDF
Extraction of chart font is never acceptable and violates
the Terms of Use. No exceptions.
and used it as the
On 8/11/2015 11:51 AM, Marcel Schneider
wrote:
All that, and some more, leads me to the
conclusion that when Windows was built, there was often not
enough time to write up the documentation;
Marcel,
please consider that this musing
On 8/8/2015 6:26 AM, Marcel Schneider
wrote:
a useless worsening of the usability and of the usefulness of a product.
Quote of the day.
A./
On 8/8/2015 1:57 PM, Marcel Schneider
wrote:
Now that I know Andrew is the PM for
MSKLC ¹,
Probably Mr Glass wasn't Mr Kaplan's boss, so he is to
overtake a legacy without having been involved in its
generating. I didn't well notice
been required
to learn, and that few people actually do learn to any great extent.
Peter
*From:*Unicode [mailto:unicode-boun...@unicode.org] *On Behalf Of
*Asmus Freytag (t)
*Sent:* Monday, August 3, 2015 12:01 PM
*To:* unicode@unicode.org
*Subject:* Re: Emoji characters for food allergens
301 - 400 of 1250 matches
Mail list logo