Re: Emoji end goal

2016-10-12 Thread Rebecca T
Agreed. I think a good response to “that’d _double_ the codepoints, so we should just add a ligature” is “if it would be such a burden to implement that you don’t want to use space in the charts for what are, fundamentally, hundreds of *semantically different* ideographs, why are we dumping that

Re: Emoji end goal

2016-10-12 Thread Rebecca T
Well, I think it’s definitely important to have representation and expression for people of all skin tones and genders even in things like emoji. I think we’re rapidly reaching a limit for variation sequences, and I’m personally not begging for hair color modifiers (although I would welcome

Re: Emoji end goal

2016-10-12 Thread Rebecca T
Sure, and kanji have romanisations but that doesn’t make the latin alphabet language neutral. And yes, emoji were supposed to be language neutral but all the implementers made them default to male. I think you have an *argument* with skin-tone neutrality but I think you’d be hard-pressed to find

Re: a character for an unknown character

2016-12-21 Thread Rebecca T
U+FFFD REPLACEMENT CHARACTER � On Wed, Dec 21, 2016 at 3:05 AM Philippe Verdy wrote: > there's a "replacement" control, whose rendering is undefined. It may > represent any missing part covering more than one character, such as parts > that have been burned, or overstrikken.

Re: Emoji Packs

2016-12-21 Thread Rebecca T
t; > On Tue, Dec 20, 2016 at 11:01 PM, Rebecca T <637...@gmail.com> wrote: > >> The charts include the images as inline base64, right?Parsing them out >> with Python might not be a bad idea, depending on how well-organized the >> HTML is. If you’re unfamiliar

Re: Emoji Packs

2016-12-22 Thread Rebecca T
do I just run process.py? > > I'm so sorry for being dumb, but thanks again! > > Chris > > On Wed, Dec 21, 2016 at 12:07 PM, Rebecca T <637...@gmail.com> wrote: > >> Okay, I threw something together. >> >> github.com/years/emoji has all 18,615 images from

Re: New tool unidump

2017-03-19 Thread Rebecca T
I maintain a list of various Unicode tools and resources at unicode.yea.rs and always welcome new additions! On Sat, Mar 18, 2017 at 1:42 AM, Janusz S. Bien wrote: > Quote/Cytat - Manuel Strehl (Fri 17 Mar 2017 > 09:44:15 PM CET): > > Hi, >> >> for

Combining solidus above for transcription of poetic meter

2017-03-17 Thread Rebecca T
When transcribing poetic meter (scansion ), it is common to use two symbols above the line (usually a breve [U+306 ̆] for stressed syllables and a solidus / slash [U+2F /] for unstressed syllables) to indicate stress patterns. Ex: ̆/ ̆ / ̆

Re: Proposal to add standardized variation sequences for chess notation

2017-04-01 Thread Rebecca T
> No chess symbols, encoded or proposed, are emoji, nor should they be. Except on Samsung .

Re: PETSCII mapping?

2017-04-06 Thread Rebecca T
Count me in! I’m partial for one large unified proposal, FWIW. On Thu, Apr 6, 2017 at 2:24 PM, Rebecca Bettencourt wrote: > On Thu, Apr 6, 2017 at 10:43 AM, Doug Ewell wrote: > >> Michael Everson wrote: >> >> > Everybody interested, raise your hand… >>

Re: Coloured Punctuation and Annotation

2017-04-05 Thread Rebecca T
> The hieroglyphs don't have the emoji property What does the emoji property mean, semantically? That the codepoint represents a pictograph or that vendors have “permission” to give it a colored, stylized representation? If we go with the first, then hieroglyphs should certainly be emoji.

Re: PETSCII mapping?

2017-04-05 Thread Rebecca T
The Wikipedia page for PETSCII [1] only marks 20 characters as not having Unicode equivalents; 2px (light) and 3px (heavy) horizontal and vertical bars at various non-center positions, diagonal shading characters, and corner characters. I’ve done some processing to the table on [1] to filter out

Re: Proposal to add standardized variation sequences for chess notation

2017-04-07 Thread Rebecca T
> while evidently there are users who need to send BROCCOLI to one another, > nobody but nobody needs to send an 8 x 8 chessboard matrix in a tweet. Get > it? I simply must disagree; sending a textual chessboard sounds awesome! A twitter bot that plays chess with you and shows you a graphical

Re: PETSCII mapping?

2017-04-05 Thread Rebecca T
> If there's a credible need to convert files between Unicode-based systems and > those using PETSCII There is! It’s called “sharing textual information” and it’s how our society functions. Can we afford to blithely abandon data from the best selling computer in history [1] because nobody cared

Re: PETSCII mapping?

2017-04-06 Thread Rebecca T
Here’s a copy of the Teletext character set; it includes box-drawing characters for all combinations of a 2×3 grid of cells. 2⁶ = 64 characters, so we might need a new block. [1]: http://www.galax.xyz/TELETEXT/CHARSET.HTM

Re: Unicode education in UK Schools

2017-07-08 Thread Rebecca T via Unicode
> That might be a good thing. Yeah. Very seriously, it’s very important to introduce Unicode early on in CS education, even in a “hey, it’s not OK to exclude people who don’t speak English or people whose names have diacritics from using the programs you create” sort of way. Ignorance and apathy

Re: unihan-etl: create exports of UNIHAN db to csv, json and yaml

2017-05-30 Thread Rebecca T via Unicode
Oh, thank god. I’ve wanted something like this for ages, but I’ve been too lazy to invest the time to create a serious tool — I’ve used a lot of messy one-time regular expressions. Will definitely be starring your repo!

Re: Can NFKC turn valid UAX 31 identifiers into non-identifiers?

2018-06-04 Thread Rebecca T via Unicode
I think that the benefits of inclusion from allowing non-ASCII identifiers far outweigh any corner cases this might cause. (Although ironing out and analyzing those is of course important, I don’t think they should be obstacles for implementing this kind of thing.) Something I’d love to see is

Re: Reminder Ribbon U+1F397

2018-06-23 Thread Rebecca T via Unicode
> > Perhaps a larger set of ribbon characters, with defined colors for > each, is called for? [image: crying pointing gun.jpg] But uh, seriously, the ribbon was encoded in the Great Wingdings and Webdings Migration of 2011 (see L2/12-368 p.21)

Re: Correct way to express in English that a string is encoded ... using UTF-8 ... with UTF-8 ... in UTF-8?

2019-05-15 Thread Rebecca T via Unicode
I think that colloquially “the file contains a UTF-8 string” is best, but perhaps not in more formal communications. On Wed, May 15, 2019, 7:24 AM Costello, Roger L. via Unicode < unicode@unicode.org> wrote: > Hello Unicode experts! > > Which is correct: > > (a) The input file contains a string.

Re: Unicode "no-op" Character?

2019-06-22 Thread Rebecca T via Unicode
Perhaps a codepoint from a private use area and another processing step to add/ remove them would work for you? On Sat, Jun 22, 2019, 1:39 AM Mark Davis ☕️ via Unicode wrote: > There nothing like what you are describing. Examples: > >1. Display — There are a few of the Default Ignorables