Re: Proposing mostly invisible characters

2019-09-13 Thread Christoph Päper via Unicode
uld need to show then, that there are instances where this is not desired. Am 13. Sep. 2019, 07:59, um 07:59, Henri Sivonen via Unicode schrieb: >On Thu, Sep 12, 2019, 15:53 Christoph Päper via Unicode > >wrote: > >> ISHY/SIHY is especially useful for encoding (German) noun compounds >in >&g

Proposing mostly invisible characters

2019-09-12 Thread Christoph Päper via Unicode
Dear Unicoders There are some characters that have no precedent in existing encodings and are also hard to attest directly from printed sources. Can one still make a solid case for encoding those in Unicode? I am thinking of characters that are either invisible (most of the time) or can

Re: Shortcuts question

2018-09-07 Thread Christoph Päper via Unicode
Shriramana Sharma: > > 1) Are shortcuts like Ctrl+C changed as per locale? I mean Ctrl+T for > "tout" io Ctrl+A for "all"? Some are, many are not. For instance, some text editors use a modifier key with F and K instead of B and I for bold ("fett") and italic ("kursiv"). > 2) How about when the

Re: Diacritic marks in parentheses

2018-08-04 Thread Christoph Päper via Unicode
Arthur Reutenauer: > On Thu, Jul 26, 2018 at 03:41:47PM -0700, Mark Davis ☕️ via Unicode wrote: > >> Ein⁽ᵉ⁾ A⁽¨⁾rzt⁽ⁱⁿ⁾ hat eine⁽ⁿ⁾ Studenti⁽ᵉ⁾n gesehen. > > “eine⁽ⁿ⁾ Student⁽ⁱ´ᵉ⁾ⁿ gesehen”. I certainly would not advocate to go to such extremes. My issue was with putting parentheses at

Re: Major vendors changing U+1F52B PISTOL  depiction from firearm to squirt gun

2018-05-24 Thread Christoph Päper via Unicode
Abe Voelker: > > I'm curious if there has been any discussion on all the major vendors > changing this emoji's depiction? ( > https://blog.emojipedia.org/all-major-vendors-commit-to-gun-redesign/) Curiously, this happened right before UTC 155 in a possibly concerted (but at least not

Re: Submissions open for 2020 Emoji

2018-04-23 Thread Christoph Päper via Unicode
Mark Davis: > > In some cases we've have contacted people to see if they want to engage > with other proposers. But to handle larger numbers we'd need a simple, > light-weight way to let people know, while maintaining people's privacy > when they want it. Collaborative editing of (proposal)

RE: Full Emoji List Chart No Longer Displaying Emoji with Skin-tones

2018-03-19 Thread Christoph Päper via Unicode
Ed Borgquist: > > Thanks for the information. Does Unicode make public the source images > received from vendors? Or, is there somewhere else you would recommend for me > to look? Besides Emojipedia.org, you will find most current sets at and many old

RE: Unicode Emoji 11.0 characters now ready for adoption!

2018-03-02 Thread Christoph Päper via Unicode
F'up2: cldr-us...@unicode.org Doug Ewell via unicode@unicode.org: > > I think that is a measurement of locale coverage -- whether the > collation tables and translations of "a.m." and "p.m." and "a week ago > Thursday" are correct and verified -- not character coverage. By the way, the binary

Re: Unicode Emoji 11.0 characters now ready for adoption!

2018-02-28 Thread Christoph Päper via Unicode
announceme...@unicode.org: > > The 157 new Emoji are now available for adoption > , But Unicode 11.0 (which all new emojis but Pirate Flag and Infinity rely upon) is not even in beta yet.

Re: 0027, 02BC, 2019, or a new character?

2018-02-21 Thread Christoph Päper via Unicode
Philippe Verdy: > > I even hope that there will be a setting in all browsers, OS'es, mobiles, > and apps to refuse any colorful rendering, and just render them as > monochromatic symbols. In summary, COMPLETETY DISABLE the colorful > extensions of OpenType made for them. See

Re: 0027, 02BC, 2019, or a new character?

2018-02-20 Thread Christoph Päper via Unicode
Michael Everson: > Why on earth would they use Ch and Sh when 1) C isn’t used by itself > and 2) if you’re using Ǵǵ you may as well use Çç Şş. I would have argued in favor of digraphs for G' and N' as well if there already was a decision for Ch and Sh. Many European orthographies use the

Re: 0027, 02BC, 2019, or a new character?

2018-02-20 Thread Christoph Päper via Unicode
Apparently the presidential decree prescribing the new Kazakh Latin orthography and alphabet has been amended recently. The change completely dumps the previous approach of digraphs with an apostrophe in second position in favor of an acute diacritic mark above the base letters for vowels Á/á,

Re: Why so much emoji nonsense?

2018-02-17 Thread Christoph Päper via Unicode
James Kass: > Asmus Freytag wrote: > >>> Words suffice. We go by what people actually say rather than whatever >>> they might have meant. When we read text, we go by what's written. > >> That is a worthy opinion, but not one that is shared, either in principle >> or in lived practice (...) by

Re: Why so much emoji nonsense?

2018-02-15 Thread Christoph Päper via Unicode
James Kass via Unicode : > Martin J. Dürst > >> The original Japanese cell phone carrier emoji where defined in the >> unassigned area of Shift_JIS, not Unicode. > > Thank you (and another list member) for reminding that it was > originally hacked SJIS rather than proper PUA

Re: 0027, 02BC, 2019, or a new character?

2018-01-23 Thread Christoph Päper via Unicode
James Kass: > > (bottle, east,skier, crucial,cherry) > s'i's'a, s'yg'ys, s'an'g'ys'y, s'es'u's'i, s'i'i'e > sxixsxa, sxygxys, sxanxgxysxy, sxesxuxsxi, sxixixe > s̈ïs̈a,s̈yg̈ys, s̈an̈g̈ys̈y, s̈es̈üs̈i, s̈ïïe > śíśa,śyǵys, śańǵyśy, śeśúśi,

Re: Emoji for major planets at least?

2018-01-19 Thread Christoph Päper via Unicode
Asmus Freytag: > > Saturn, with its rings (even though it's no longer the only one known > with rings) also is iconic and highly recognizable. I lack imagination > as to when someone would want to use it in communication, but I have the > same issue with quite a few recent emoji, some of which

Re: 0027, 02BC, 2019, or a new character?

2018-01-17 Thread Christoph Päper via Unicode
James Kass via Unicode : > > It will probably be the ASCII apostrophe. The stated intent favors > the apostrophe over diacritics or special characters to ensure that > the language can be input to computers with standard keyboards. Yes, this can only mean U+0027, but

Re: Emoji’s

2018-01-11 Thread Christoph Päper via Unicode
jillian mestel: > > I was very disappointed to learn that there are no emojis of portraying a > dominant left hand. See for the general emoji proposal process. This would actually not need a new character being assigned a code point, because existing

Include emoticons in CLDR character annotation?

2017-12-14 Thread Christoph Päper via Unicode
The CLDR Survey Tool is currently open to, among other things, collect improvements to (emoji) character names and keywords. I don't see it being done for any language yet, but wouldn't it make sense to add classic emoticons (like :-) for various smiling emojis), kaomoji (like o/ for Person

Re: Version linking?

2017-08-17 Thread Christoph Päper via Unicode
Mark Davis ☕️: > > E5.0 did have the emoji properties of some 10.0 characters a bit ahead of > time, but only after they were completely locked down. This should be absolutely avoided in the future, because it was effectively the other way around: No changes to the beta characters were possible

Re: Turtle Graphics Emoji

2017-07-29 Thread Christoph Päper via Unicode
Shriramana Sharma via Unicode: > > for animal in animalKingdom: createEmojiProposal(animal) > > Emoji are a veritable Pandora box. It makes sense to look at where (animal) emojis came from and then decide how this might need to be expanded to avoid cultural bias etc. A large bunch of the

Re: First bonafide use (≠ mention) of emoji by an academic publisher?

2017-07-24 Thread Christoph Päper via Unicode
Leonardo Boiko: > > It would just be more > satisfying for me if the blue books were encoded in the font as U+1F4D8s, > rather than U+F02Ds. Or, if the colors are done at a CSS level, as  > U+1F4D5 CLOSED BOOKs or the like. Same goes for the other icons in FA > which *do *have an emoji

Emoji Space

2017-07-17 Thread Christoph Päper via Unicode
As you may know, the combined original Japanese emoji set included three whitespace characters: one was the full width of a (square) emoji, one was half that and the last one was a quarter blank. Their KDDI Shift-JIS codes were F7A9, F7AA and F7AB, respectively, and their internal numeric IDs

Re: LATIN CAPITAL LETTER SHARP S officially recognized

2017-06-30 Thread Christoph Päper via Unicode
Letters in some scripts are a class of two or more characters. Usually, all letters have the same number of such case variants. Rarely, characters may be constituents of different letters within the same script. A closed set of letters, usually with a canonical sort order, makes an alphabet.

Re: The management of the encoding process of emoji

2017-06-19 Thread Christoph Päper via Unicode
William_J_G Overington: > > http://www.unicode.org/L2/L2017/17192-response-cmts.pdf > http://www.unicode.org/L2/L2017/17147-emoji-subcommittee.pdf > >> Two key issues are whether the characters are likely to be popular >> and whether they would be supported by major vendors. > > I am rather

Extended_Pictographic in Unicode Utilities

2017-04-18 Thread Christoph Päper via Unicode
Maybe I just need to be a patient a bit longer, but maybe I'm missing something: Since CLDR and ICU have been updated recently and the Extended_Pictographic codepoint property has been added, shouldn't it be accessible within Unicode Utilities: UnicodeSet?

Re: Proposal to add standardized variation sequences for chess notation

2017-04-10 Thread Christoph Päper
Michael Everson <ever...@evertype.com>: > On 7 Apr 2017, at 23:17, Christoph Päper <christoph.pae...@crissov.de> wrote: > > >> The only connection this has with emoji is that it uses the variation > >> selector system. > > > > As I've shown, that'

Re: Proposal to add standardized variation sequences for chess notation

2017-04-07 Thread Christoph Päper
Garth Wallace <gwa...@gmail.com>: > On Thu, Apr 6, 2017 at 5:19 AM, Christoph Päper <christoph.pae...@crissov.de> > wrote: > > > Although Michael Everson readily dismisses any connection to emojis, (...) > > normal emoji design actually matches "diagram&

Re: Proposal to add standardized variation sequences for chess notation

2017-04-06 Thread Christoph Päper
> Michael Everson hat am 6. April 2017 um 14:57 > geschrieben: > >> That's what this proposal is all about. It's a good and sound proposal, >> except for the empty square. > > Do you mean “except for the light and dark squares without a piece on them” or > “except for the

Eszett variation sequence

2017-04-06 Thread Christoph Päper
U+00DF Latin Letter Sharp S ⟨ß⟩ has at least two rather different visual styles resulting from a ligature of either long and round lowercase S, ⟨ſs⟩, or of long S and normal or tailed lowercase Z, ⟨ſz⟩ or ⟨ſʒ⟩. Most modern typeface designs follow the first style and sometimes the right-hand side

Re: Proposal to add standardized variation sequences for chess notation

2017-04-06 Thread Christoph Päper
Mark Davis ☕️ : > > I'm looking forward to similar postings on checkers and go pieces. (...) > And I'm looking also forward to the ♖+ZWJ+⬛️ (etc) proposal. Well, actually ... Garth Wallace made an important observation in

Re: Proposal to add standardized variation sequences for chess notation

2017-04-06 Thread Christoph Päper
Michael Everson : > > Standardized variation sequences are the best way to achieve this simply and > without needless duplication. :-) I still agree with this assertion. > > The distinction between white/black background might be of a different > > nature. If you have

Re: Emoji Compatibility Symbols

2017-04-05 Thread Christoph Päper
Charlotte Buff : > > That document was very helpful, but unfortunately many of the images are > missing. would fix that.

Re: Proposal to add standardized variation sequences for chess notation

2017-04-02 Thread Christoph Päper
Michael Everson <ever...@evertype.com>: > On 1 Apr 2017, at 21:57, Christoph Päper <christoph.pae...@crissov.de> wrote: > > > □ U+25A1 and, especially, ▨ U+25A8 for empty fields on a board make no > > sense. > > Not so. Think about the data. I do, but I

Re: Unicode Emoji 5.0 characters now final

2017-03-30 Thread Christoph Päper
Charlotte Buff > > > Heck, if your device has a default font that includes CANCEL TAG (...) and > therefore doesn’t render it, > then you won’t even be able to see the difference between a regular, generic > black flag and an emoji that was meant to represent

Re: Unicode Emoji 5.0 characters now final

2017-03-30 Thread Christoph Päper
Philippe Verdy hat am 30. März 2017 um 00:40 geschrieben: > There's no ISO 3166-1 code for Europe at the whole (does it exist legally if > we can't clearly define its borders?) `150` in UN M.49 which ISO 3166-1 was derived from and is compatible with. CLDR could safely adopt

Re: Unicode Emoji 5.0 characters now final

2017-03-29 Thread Christoph Päper
Richard Wordingham : > "Doug Ewell" wrote: > > > "Not recommended," "not standard," "not interoperable," or any other > > term ESC settles on for the 5000+ valid flag sequences that are not > > England, Scotland, and Wales is just a short, easy

Re: Unicode Emoji 5.0 characters now final

2017-03-29 Thread Christoph Päper
Mark Davis ☕️ : > On Tue, Mar 28, 2017 at 11:56 AM, Joan Montané wrote: > > > 1st one: point 4 (Unicode subdivision codes listed in emoji Unicode site) > > arises something like chicken-egg problem. Vendors don't easily add new > > subdivision-flags (because

Re: WAP Pictogram Specification as Emoji Source

2017-02-13 Thread Christoph Päper
Mark Davis ☕️ : > > Given the status of WAP, I don't think there is any particular need to seek > compatibility for it. Much of WAP/WML is deprecated by recent OMA specs, but Pictogram is still an (optional) part of Browsing V2.4 from 2011 – or more specifically: Mobile

Re: WAP Pictogram Specification as Emoji Source

2017-02-13 Thread Christoph Päper
Ken Whistler <kenwhist...@att.net>: > On 2/13/2017 1:39 AM, Christoph Päper wrote: >> - music/rest – is that what 〽️ or 〰️ means? > > The first of those is presumably U+303D PART ALTERNATION MARK, and the second > is probably the notorious U+3030 WAVY DASH. So not emo

Re: WAP Pictogram Specification as Emoji Source

2017-02-13 Thread Christoph Päper
Christoph Päper <christoph.pae...@crissov.de> 2016-01-07: > > I just discovered the WAP Pictogram specification (WAP-213-WAPInterPic), last > published in April 2001 and updated in November 2001. There was a subsequent update by OMA in 2006, but it didn’t change any pictog

Re: Curly Lips Code Point Proposal

2017-01-27 Thread Christoph Päper
Leonardo Boiko : > > That's why U+1F481 INFORMATION DESK PERSON  is listed with the keyword > "sassy" in the Unicode emoji table (besides "tipping hand"). Which helps a > lot, because the keywords are used by input methods to search characters; if > no one bothered

Re: Superscript and Subscript Characters in General Use

2017-01-16 Thread Christoph Päper
Marcel Schneider : > > What typically happens with the correct use of fraction slash on a > collaborative > website like Wikipedia, is that the superscripts and subscripts are restored, JFTR, has been using the fraction slash

WAP Pictogram Specification as Emoji Source

2017-01-06 Thread Christoph Päper
I just discovered the WAP Pictogram specification (WAP-213-WAPInterPic), last published in April 2001 and updated in November 2001. - http://www.wapforum.org/what/technical.htm (requires OMA credentials) - http://www.openmobilealliance.org/tech/affiliates/wap/wap-213-wapinterpic-20010406-a.pdf

Re: a character for an unknown character

2017-01-03 Thread Christoph Päper
Marcel Schneider <charupd...@orange.fr>: > On Sat, 31 Dec 2016 11:01:16 +0100, Christoph Päper wrote: >> >> It has indeed. That’s why two different technologies have to be used to get >> typographically harmonic hexadecimal numbers, e.g. in CSS: … > > Than

Re: a character for an unknown character

2016-12-31 Thread Christoph Päper
Richard Wordingham : > >> Perhaps the letters for hexadecimal digits should have been encoded >> separately? > > The idea has been rejected several times. It has indeed. That’s why two different technologies have to be used to get typographically harmonic

Re: About standardized variants of characters in Dingbat block

2016-12-28 Thread Christoph Päper
Michael Everson <ever...@evertype.com>: > On 27 Dec 2016, at 20:05, Mark Davis ☕️ <m...@macchiato.com> wrote: >> On Tue, Dec 27, 2016 at 7:15 PM, Christoph Päper >> <christoph.pae...@crissov.de> wrote: >>> ✊✋✌️年? >> ​I'd use:​ I guess I’d prefe

Re: About standardized variants of characters in Dingbat block

2016-12-27 Thread Christoph Päper
Mark Davis ☕️ : > > > people are still missing the lizard. ;) > > http://unicode.org/emoji/charts-beta/emoji-list.html#1f98e Um, ✊✋✌️年? The Lizard is a hand sign where the tips of thumb and index finger touch and point sidewards. The other 3 fingers can either also touch

Re: Emoji Packs

2016-12-20 Thread Christoph Päper
Chris Monteleone : > > I would like to download emoji from every available vendor into a well > organized folder with the code points as file names. I assume you’re looking for .

Re: Emoji mappings in Shift JIS / CP932/943

2016-12-03 Thread Christoph Päper
Markus Scherer <markus@gmail.com>: > > > On Fri, Dec 2, 2016 at 4:35 AM, Christoph Päper <christoph.pae...@crissov.de> > wrote: >> Could and should custom vendor extensions like the ones documented in >> [EmojiSources.txt] be included in these map

Emoji mappings in Shift JIS / CP932/943

2016-12-02 Thread Christoph Päper
I understand from - http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WindowsBestFit/readme.txt that Windows codepage 932 (IBM CP943) is basically (a superset of) Shift-JIS (JIS X 0208 A1). There are at least 3 related mapping files: -

Re: Manatee emoji?

2016-11-23 Thread Christoph Päper
James Kass : > > And, how about [other emoji]? Are any petitions underway for them? For what it’s worth, several weeks ago (before UTC149), I collected all emoji petitions I could find online (and that were in languages I can at least somewhat decipher). I’m excluding

Re: "Oh that's what you meant!: reducing emoji misunderstanding"

2016-11-17 Thread Christoph Päper
Doug Ewell : > > Or, people could just say what they mean, using language. That’s not how language (or communication in general) works. At all.

Re: Why incomplete subscript/superscript alphabet ?

2016-10-06 Thread Christoph Päper
Philippe Verdy : > > But if semantic is your issue, we could insert an invisible Unicode mark of > abbreviation (notably the invisible abbreviation dot, which may be rendered > as a dot in some contexts where distinctions by styles cannot be used, or > could be rendered by

Re: Why incomplete subscript/superscript alphabet ?

2016-10-06 Thread Christoph Päper
Jukka K. Korpela : > > … the solution is to use just “3ème”, perhaps with some method (“above” the > character level) used to format the letters as superscript, when not limited > to plain text … For ordinal numbers, it’s relatively simple to code language-dependent glyph

Re: IJ with accent

2016-09-29 Thread Christoph Päper
a.lukyanov : > > Dutch language writing uses the ligature ij (U+0132, U+0133). When accented, > it should take an accent on each component, > > However, if monolithic ligature ij is used, how one can accent it correctly? JFTR: - ij U+0133 - ij́ U+0133+0301 - ij̋ U+0133+030B -

Re: graphemes

2016-09-28 Thread Christoph Päper
Janusz S. Bień : > On Tue, Sep 27 2016 at 16:28 CEST, christoph.pae...@crissov.de writes: >>> And what is "grapheme" in "technical (i.e. Unicode) jargon"? >> >> It depends on the script (hence Unicode block), but not the writing >> system or language. The line is not always

Re: graphemes

2016-09-27 Thread Christoph Päper
Janusz S. Bień <jsb...@mimuw.edu.pl>:On Sun, Sep 18 2016 at 12:26 CEST, jsb...@mimuw.edu.pl writes:Quote/Cytat - Christoph Päper <christoph.pae...@crissov.de> (pią, 16wrz 2016, 23:51:38):Janusz S. Bień <jsb...@mimuw.edu.pl>:1. Graphemes, if I understand correctly, are language

Re: graphemes (was: "textels")

2016-09-20 Thread Christoph Päper
Julian Bradfield <jcb+unic...@inf.ed.ac.uk>: > On 2016-09-19, Christoph Päper <christoph.pae...@crissov.de> wrote: >> If _encyclopedia, encyclopædia, encyclopaedia_ are all legal spellings of >> the same word in a writing system, a useful linguistic definition of

Re: "textels"

2016-09-18 Thread Christoph Päper
Janusz S. Bien : > > From the Unicode glossary: > >> Grapheme. (1) A minimally distinctive unit of writing in the context of a >> particular writing system.[...] (2) What a user thinks of as a character. > >> User-Perceived Character. What everyone thinks of as a character

Re: "textels"

2016-09-16 Thread Christoph Päper
Janusz S. Bień : > > 1. Graphemes, if I understand correctly, are language dependent, … That’s true in linguistic terminology – well, at least within the more popular schools of thought –, but not in technical (i.e. Unicode) jargon.

Re: Additional Emoji selection factor: Support by "Major Vendors"

2016-09-11 Thread Christoph Päper
Philippe Verdy <verd...@wanadoo.fr>: > 2016-09-11 14:40 GMT+02:00 Christoph Päper <christoph.pae...@crissov.de>: > >> I haven’t been able to find out what constitutes a “major vendor”. Apple, >> Microsoft and Google are certainly ones (…), but what about, for i

Additional Emoji selection factor: Support by "Major Vendors"

2016-09-11 Thread Christoph Päper
Mark Davis ☕️ : > > In order to understand the status of any document in the registry, you need > to also look at the minutes of the meeting where they are discussed, in this > case: http://www.unicode.org/L2/L2016/16121.htm > >> B.14.3 Provisional value for Emoji property

Re: Comment in a leading German newspaper regarding the way UTC and Apple handle Emoji as an attack on Free Speech

2016-09-04 Thread Christoph Päper
Asmus Freytag (c) : > > The central concern of the FAZ article appears to be the role that private > entities play as gate-keepers of modern communication. That's actually a > valid concern (…). The fact that fine distinctions of a technical nature may > have been handled

Re: [UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-09-02 Thread Christoph Päper
Philippe Verdy <verd...@wanadoo.fr>:2016-08-25 16:52 GMT+02:00 Christoph Päper <christoph.pae...@crissov.de>:I’ll assure you that a feminism/LGBT shitstorm will be heading for UTC and vendors if binary gender became mandatory for profession emojis. You should not approve Google’s and

Re: [UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-09-02 Thread Christoph Päper
Mark Davis ☕️ : > > In order to understand the status of any document in the registry, you need > to also look at the minutes of the meeting where they are discussed, I’ve said it before: Unicode is lacking a proper public issue tracker.

Re: Comment in a leading German newspaper regarding the way UTC and Apple handle Emoji as an attack on Free Speech

2016-09-02 Thread Christoph Päper
Doug Ewell : > [FAZ:] >> ("The Unicode Consortium appears like a reissue of Orwell's Ministry >> of Truth, which replaced the English language by a new one, sweeped >> clean from harmful terms, and which removed "unorthodox" connotations >> from the rest of the words.") > > I

Re: [UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-09-02 Thread Christoph Päper
Christoph Päper <christoph.pae...@crissov.de>: > > I just learned that recent Samsung phones already contain emoji > representations for many of these symbols. And I’ve finally seen <http://www.unicode.org/L2/L2016/16087-provisional-value-for-emoji.pdf>. A pointer would have been nice.

Re: I'm excited about the proposal to add a brontosaurus emoji codepoint

2016-08-31 Thread Christoph Päper
Leonardo Boiko : > > All of the current face expression emoji are to be decomposed as FACE plus > abstract combining characters; for example, U+1F642 SLIGHTLY SMILING FACE > will be considered a compatibility variant of FACE + COMBINING SMILE + > COMBINING SLIGHT

Re: [UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-08-27 Thread Christoph Päper
Garth Wallace <gwa...@gmail.com>: > On Fri, Aug 26, 2016 at 4:07 AM, Christoph Päper > <christoph.pae...@crissov.de> wrote: >> I just learned that recent Samsung phones already contain emoji >> representations for many of these symbols. >> >&

Re: [UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-08-26 Thread Christoph Päper
Christoph Päper <christoph.pae...@crissov.de>: > > No, but many, perhaps most of ‘General Category = Other_Symbol (So), Script = > Common, Bidirectional Category = Other_Neutral (ON)’ probably and few others > (e.g. with ‘Bidirectional Category = L’). That’s little more than

Re: [UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-08-25 Thread Christoph Päper
TL;DR: Unicode properties should reflect user expectations, not vendor choices. Mark Davis ☕️ <m...@macchiato.com>: > On Mon, Aug 22, 2016 at 11:26 PM, Christoph Päper > <christoph.pae...@crissov.de> wrote: >> 1. it’s incomplete without an explicit neutral/ambiguous al

Re: [UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-08-22 Thread Christoph Päper
JFTR, on the meantime, I had already resubmitted my original post through the proper feedback channel as a comment on the current draft of TR51. Mark Davis ☕️ : > > Similarity or containment in the same block as current emoji characters is > not sufficient grounds for

"Emojis" in Reading Texts for Beginners

2016-08-21 Thread Christoph Päper
Are in-line pictures in reading instruction books, standing in mostly for nouns, considered supporting proof of existing use of proposed symbols or emojis? I recently realized, reading a children’s book to/with my sons, that a lot of the pictograms – I estimated 80% in my sample – could

Fwd: Text vs. Emoji: Default Text Style; no VS

2016-08-18 Thread Christoph Päper
The first column of the table on that page is completely empty. I assume it was needed in a previous version, but now it should be removed or am I missing something? Total counts of characters for cells, rows and columns would be nice.

Emoji Variation Sequences: relaxing VS15/16

2016-08-18 Thread Christoph Päper
While VS1–VS14 (and VS17–256) are used for more or less arbitrary variant selection (VS4–VS14 are actually still unused), VS15 and VS16 have conventional de-facto semantics: select text style or emoji style. StandardizedVariants.txt explicitly claims, referring to Section 23.4 of Unicode 9,

Registry for Emoji Ligature and Metaphor Sequences

2016-08-18 Thread Christoph Päper
Situation - Ever since the advent of emojis in Unicode they have been combined with each other to convey new meaning in a number of ways, similar to what has been described for other (ancient) pictographic codes and had been the base for some proper scripts. Many combinations can be

Completing Emoji Sets: Animals and Zodiac Signs

2016-08-18 Thread Christoph Päper
Not least the TERIS proposal has taught me that, at least for emojis, Unicode’s encoding policy has changed from “show use, get character” to “show system with gap, get character”. Well, then, I’ve noticed several incomplete sets or systems in the emoji ranges. This is the first of a planned

Re: The Relationship of 1-Line Character Art and Emojis

2016-08-17 Thread Christoph Päper
Steven R. Loomis <s...@icu-project.org>: > El 8/16/16 4:22 PM, "Unicode en nombre de Christoph Päper": > >> is there any document issued by the Unicode Consortium which acknowledges a >> standard set of “short names” as used in :colon_codes:? There are

The Relationship of 1-Line Character Art and Emojis

2016-08-16 Thread Christoph Päper
Is a combination of characters solely for the combined glyphic appearance, which is frequently used over a long time, commonly considered a proof of demand for encoding additional symbols or emojis? Most Western emoticons have one or more conventional ASCII-only strings representing a

[UTR#51-8] 2.4 Emoji Implementation Notes

2016-08-12 Thread Christoph Päper
> … including the user of … Should be just “use”. > * emoji zwj sequence > - may have an emoji variation selector. > - should be displayed with an emoji presentation by default, even when an > emoji zwj element is

[UTR#51-8] 1.4.3 Emoji Variation Sequences: Female/Venus and Male/Mars Signs

2016-08-12 Thread Christoph Päper
> 2640 FE0E; text style; # FEMALE SIGN > 2640 FE0F; emoji style; # FEMALE SIGN > 2642 FE0E; text style; # MALE SIGN > 2642 FE0F; emoji style; # MALE SIGN Since U+240 and U+2642 double as symbols for the planets (and

Re: Ambiguous hyphenation cases with

2014-07-22 Thread Christoph Päper
fantasai fantasai.li...@inkedblade.net: The problem is that the hyphenation system in itself can't decide how to change the spelling, without any dictionary functionality. It can't know if I meant mat-tjuv (food thief in Swedish) or matt-tjuv (carpet thief) when I wrote matshy;tjuv. So

Re: Missing geometric shapes

2012-11-12 Thread Christoph Päper
Kent Karlsson: Den 2012-11-11 23:08, skrev Doug Ewell d...@ewellic.org: or turning Unicode into a standard for rating systems in general, Using digits, it already covers that. Here I agree. (Not sure why that branch of this tread is still ongoing...) Perhaps because today people usually

Re: Missing geometric shapes

2012-11-11 Thread Christoph Päper
Stephan Stiller (2012-11-07 21:35): HIGHEST RATING * HIGHER RATING+ HIGH RATING MID-HIGH RATING ***+ MEDIUM RATING*** MID-LOW RATING **+ LOW RATING ** LOWER RATING *+ LOWEST RATING* NO RATING This way font designers could choose

Re: Missing geometric shapes

2012-11-11 Thread Christoph Päper
Jean-François Colson (2012-11-09 10:26): Le 09/11/12 00:40, Philippe Verdy a écrit : For this one, would it be a greyed star (meaning no info, N/A) or the existing WHITE STAR for the minimum rating (the maximum rating being the BLACK STAR) ? Simply a black and white star. I intended NO

Re: Missing geometric shapes

2012-11-07 Thread Christoph Päper
Jörg Knappen: The reason is that I just was trying to show the rating on a webpage using the popular of 1 to 5 starts including half-coloured starts just using UNicode characters. BLACK AND WHITE STAR WHITE AND BLACK STAR In Dingbats, characters are mostly coded for their appearance,

Re: Too narrowly defined: DIVISION SIGN COLON

2012-07-10 Thread Christoph Päper
Leif Halvard Silli: * that the DIVISION SIGN in the (human) mathematical notation of at least one language (Norwegian) functions as a stylistically distinct MINUS sign. Ain’t that a stylistic, glyphic (i.e. font-dependent) variant of ‘⁒’ U+2052 Commercial Minus Sign, not a special use

Re: Code2000 on SourceForge

2012-02-03 Thread Christoph Päper
James Kass: License already included in SourceForge download, namely GPLv3. You probably want to use GPL+FE, i.e. GPL with font exception. http://en.wikipedia.org/wiki/GPL_font_exception

Re: Purpose of plain text

2011-11-15 Thread Christoph Päper
Doug Ewell: How can I search a group of documents, one written in Devanagari and another in Sinhala and another in Tamil and another in Oriya, for a given string if they all use the same encoding, and the only way to tell which is which is to see them rendered in a particular font? That

Re: tips on writing character proposal

2011-11-10 Thread Christoph Päper
Mark E. Shoulson: But Unicode isn't about encoding what would be neat to encode. It's about encoding _text_, (including things that have been encoded before). It’s sometimes non-obvious, though, what one should consider as text and what one should not, e.g. mathematic formulae. I would

Re: Continue: Glaring Mistake in the Code List of South Asian Script, Reply to Daug Ewell and Others

2011-09-12 Thread Christoph Päper
Delex, you are obviously confusing character sets, scripts, writing systems, orthographies, languages, peoples and names thereof (which may vary across languages and applications). NB: Some might argue that Unicode already distinguishes Indic scripts on a finer level than necessary, since

Re: ligature usage - WAS: How do we find out what assigned code points aren't normally used in text?

2011-09-12 Thread Christoph Päper
Szelp A. Szabolcs: Even if Dorfladen is not ambigous, it could be disturbing ‹Dorfladen› and ‹Auflage› certainly are disturbing. For the current German orthography, smart fonts should rather sport ligatures for double consonants, especially when followed by a third one of their kind in

Re: ligature usage - WAS: How do we find out what assigned code points aren't normally used in text?

2011-09-12 Thread Christoph Päper
Philippe Verdy: And it would be desirable to have a standardized CSS property for controling this default behavior in browsers. http://www.w3.org/TR/css3-fonts/#font-variant-ligatures-prop (In my opinion there would be better ways to spec this, though.)

Re: Code pages and Unicode

2011-08-19 Thread Christoph Päper
John H. Jenkins: there would have to be a *lot* of writing systems out there we don't know about to fill up planes 4 through 14 That’s quite possible, though, the universe is huge. The question rather is whether we will ever know about them. It’s quite possible we won’t.

Re: Unifon

2011-07-03 Thread Christoph Päper
Karl Pentzlin: Attached is a Unifon chart as used for Hupa, according to http://eric.ed.gov/PDFS/ED286691.pdf , p. 12. That’s it? Looks like diacritics to me, combined with some typographic preferences and a changed collation sequence perhaps. a/A with preferred typographic uppercase

Re: Draft Proposal to add Variation Sequences for Latin and Cyrillic letters

2010-08-03 Thread Christoph Päper
Karl Pentzlin: The proposed variation sequences simply provide a more general access to typographic details, which now can be accomplished by more complicated means like implementing locale-specific glyph selection within a font, and relying on a higher-level protocol supplying the

Re: Pashto yeh characters

2010-07-27 Thread Christoph Päper
David Starner: On Tue, Jul 27, 2010 at 12:43 PM, Andreas Prilop [U+0649] is no Arabic letter, but an Uighur letter. That's wrong, though. […] U+0649 must be an Arabic character; Andreas probably meant that U+0649 is not part of the Arabic writing system, i.e. the Arabic script as used in

Re: [indic] Indian Rupee symbol

2010-07-16 Thread Christoph Päper
Michael Everson: A proposal to add the character to the Unicode Standard and ISO/IEC 10646 was published yesterday. http://std.dkuug.dk/jtc1/sc2/wg2/docs/n3862.pdf I’m not sure I’m getting this right. This is how I understand the issue: There is a rupee sign encoded at U+20A8. Its example

Re: Using Javascript to Detect Script Support in a Browser

2010-06-17 Thread Christoph Päper
Ed Trager: On Tue, Jun 15, 2010 at 5:52 PM, Marc Durdin P { font-family: Code2000, MyCode2000; } @font-face { font-family: MyCode2000; src: url('code2000.ttf'); } P { font-family: MyCode2000; } @font-face { font-family: MyCode2000; src: local(Code2000), url('code2000.ttf'); } (…)

  1   2   >