Re: Why do binary files contain text but text files don't contain binary?

2020-02-21 Thread Hans Åberg via Unicode
> On 21 Feb 2020, at 13:21, Costello, Roger L. via Unicode > wrote: > > There are binary files and there are text files. In C, when opening a file as binary with the function fopen, the newlines are untranslated [1]. If not using this option, the file is informally text, which means that

Re: What should or should not be encoded in Unicode? (from Re: Egyptian Hieroglyph Man with a Laptop)

2020-02-14 Thread Hans Åberg via Unicode
> On 13 Feb 2020, at 16:41, wjgo_10...@btinternet.com via Unicode > wrote: > > Yet a Private Use Area encoding at a particular code point is not unique. > Thus, except with care amongst people who are aware of the particular > encoding, there is no interoperability, such as with regular

Re: Egyptian Hieroglyph Man with a Laptop

2020-02-13 Thread Hans Åberg via Unicode
> On 13 Feb 2020, at 00:26, Shawn Steele wrote: > >> From the point of view of Unicode, it is simpler: If the character is in use >> or have had use, it should be included somehow. > > That bar, to me, seems too low. Many things are only used briefly or in a > private context that doesn't

Re: Egyptian Hieroglyph Man with a Laptop

2020-02-12 Thread Hans Åberg via Unicode
> On 12 Feb 2020, at 23:30, Michel Suignard via Unicode > wrote: > > These abstract collections have started to appear in the first part of the > nineteen century (Champollion starting in 1822). Interestingly these > collections have started to be useful on their own even if in some case

Re: Geological symbols

2020-01-14 Thread Hans Åberg via Unicode
For rendering, you might have a look at ConTeXt, because I recall it has an option whereby Unicode super- and sub-scripts can be displayed over each other without extra processing. > On 14 Jan 2020, at 06:44, via Unicode wrote: > > Thanks for your reply. I think actually LaTeX is not a good

Re: Pure Regular Expression Engines and Literal Clusters

2019-10-14 Thread Hans Åberg via Unicode
> On 14 Oct 2019, at 02:10, Richard Wordingham via Unicode > wrote: > > On Mon, 14 Oct 2019 00:22:36 +0200 > Hans Åberg via Unicode wrote: > >>> On 13 Oct 2019, at 23:54, Richard Wordingham via Unicode >>> wrote: > >>> Besides invalidat

Re: Pure Regular Expression Engines and Literal Clusters

2019-10-13 Thread Hans Åberg via Unicode
> On 13 Oct 2019, at 23:54, Richard Wordingham via Unicode > wrote: > > The point about these examples is that the estimate of one state per > character becomes a severe underestimate. For example, after > processing 20 a's, the NFA for /[ab]{0,20}[ac]{10,20}[ad]{0,20}e/ can > be in any of

Re: Pure Regular Expression Engines and Literal Clusters

2019-10-13 Thread Hans Åberg via Unicode
> On 13 Oct 2019, at 21:17, Richard Wordingham via Unicode > wrote: > > On Sun, 13 Oct 2019 15:29:04 +0200 > Hans Åberg via Unicode wrote: > >>> On 13 Oct 2019, at 15:00, Richard Wordingham via Unicode >>> I'm now beginning to wonder what you are

Re: Pure Regular Expression Engines and Literal Clusters

2019-10-13 Thread Hans Åberg via Unicode
> On 13 Oct 2019, at 15:00, Richard Wordingham via Unicode > wrote: > >>> On Sat, 12 Oct 2019 21:36:45 +0200 >>> Hans Åberg via Unicode wrote: >>> >>>>> On 12 Oct 2019, at 14:17, Richard Wordingham via Unicode >>>>> wrote: &

Re: Pure Regular Expression Engines and Literal Clusters

2019-10-13 Thread Hans Åberg via Unicode
> On 13 Oct 2019, at 00:37, Richard Wordingham via Unicode > wrote: > > On Sat, 12 Oct 2019 21:36:45 +0200 > Hans Åberg via Unicode wrote: > >>> On 12 Oct 2019, at 14:17, Richard Wordingham via Unicode >>> wrote: >>> >>> But remember t

Re: Pure Regular Expression Engines and Literal Clusters

2019-10-12 Thread Hans Åberg via Unicode
> On 12 Oct 2019, at 14:17, Richard Wordingham via Unicode > wrote: > > But remember that 'having longer first' is meaningless for a > non-deterministic finite automaton that does a single pass through the > string to be searched. It is possible to identify all submatches deterministically

Re: Symbols of colors used in Portugal for transport

2019-04-30 Thread Hans Åberg via Unicode
> On 30 Apr 2019, at 04:32, Mark E. Shoulson via Unicode > wrote: > > On 4/29/19 3:34 PM, Doug Ewell via Unicode wrote: >> Hans Åberg wrote: >> >>> The guy who made the artwork for Heroes is completely color-blind, >>> seeing only in a gra

Re: Symbols of colors used in Portugal for transport

2019-04-29 Thread Hans Åberg via Unicode
> On 29 Apr 2019, at 21:34, Doug Ewell wrote: > > Hans Åberg wrote: > >> The guy who made the artwork for Heroes is completely color-blind, >> seeing only in a grayscale, so they agreed he coded the colors in >> black and white, and then that was replace

Re: Symbols of colors used in Portugal for transport

2019-04-29 Thread Hans Åberg via Unicode
> On 29 Apr 2019, at 20:02, Doug Ewell via Unicode wrote: > > Philippe Verdy wrote: > >> A very useful think to add to Unicode (for colorblind people) ! >> >> http://bestinportugal.com/color-add-project-brings-color-identification-to-the-color-blind >> >> Is it proposed to add as new

Re: MODIFIER LETTER SMALL GREEK PHI in Calibri is wrong.

2019-04-17 Thread Hans Åberg via Unicode
You are possibly both right, because it is OK in the web font but wrong in the desktop font. > On 17 Apr 2019, at 23:53, Oren Watson via Unicode wrote: > > You can easily reproduce this by going here: > https://www.fonts.com/font/microsoft-corporation/calibri/regular > and putting in the

Re: A last missing link for interoperable representation

2019-01-15 Thread Hans Åberg via Unicode
> On 15 Jan 2019, at 02:18, Richard Wordingham via Unicode > wrote: > > On Mon, 14 Jan 2019 16:02:05 -0800 > Asmus Freytag via Unicode wrote: > >> On 1/14/2019 3:37 PM, Richard Wordingham via Unicode wrote: >> On Tue, 15 Jan 2019 00:02:49 +0100 >> Hans Åb

Re: A last missing link for interoperable representation

2019-01-14 Thread Hans Åberg via Unicode
> On 14 Jan 2019, at 23:43, James Kass via Unicode wrote: > > Hans Åberg wrote, > > > How about using U+0301 COMBINING ACUTE ACCENT: 푝푎푠푠푒́ > > Thought about using a combining accent. Figured it would just display with a > dotted circle but neglected to try

Re: A last missing link for interoperable representation

2019-01-14 Thread Hans Åberg via Unicode
> On 13 Jan 2019, at 22:43, Khaled Hosny via Unicode > wrote: > > LaTeX with the > “unicode-math” package will translate ASCII + font switches to the > respective Unicode math alphanumeric characters. Word will do the same. > Even browsers rendering MathML will do the same (though most likely

Re: A last missing link for interoperable representation

2019-01-14 Thread Hans Åberg via Unicode
> On 14 Jan 2019, at 06:08, James Kass via Unicode wrote: > > 퐴푟푡 푛표푢푣푒푎푢 seems a bit 푝푎푠푠é nowadays, as well. > > (Had to use mark-up for that “span” of a single letter in order to indicate > the proper letter form. But the plain-text display looks crazy with that > HTML jive in it.) How

Re: Update to the second question summary (was: A sign/abbreviation for "magister")

2018-12-02 Thread Hans Åberg via Unicode
> On 2 Dec 2018, at 20:29, Janusz S. Bień via Unicode > wrote: > > On Sun, Dec 02 2018 at 10:33 +0100, Hans Åberg via Unicode wrote: >> >> It was common in the 1800s to singly and doubly underline superscript >> abbreviations in handwriting according t

Re: A sign/abbreviation for "magister"

2018-12-02 Thread Hans Åberg via Unicode
> On 30 Oct 2018, at 22:50, Ken Whistler via Unicode > wrote: > > On 10/30/2018 2:32 PM, James Kass via Unicode wrote: >> but we can't seem to agree on how to encode its abbreviation. > > For what it's worth, "mgr" seems to be the usual abbreviation in Polish for > it. It was common in

Re: Aleph-umlaut

2018-11-11 Thread Hans Åberg via Unicode
> On 12 Nov 2018, at 00:00, Asmus Freytag (c) wrote: > > On 11/11/2018 1:37 PM, Hans Åberg wrote: >>> On 11 Nov 2018, at 22:16, Asmus Freytag via Unicode >>> wrote: >>> >>> On 11/11/2018 12:32 PM, Hans Åberg via Unicode wrote: >>>

Re: Aleph-umlaut

2018-11-11 Thread Hans Åberg via Unicode
> On 11 Nov 2018, at 22:16, Asmus Freytag via Unicode > wrote: > > On 11/11/2018 12:32 PM, Hans Åberg via Unicode wrote: >> >>> On 11 Nov 2018, at 07:03, Beth Myre via Unicode >>> wrote: >>> >>> Hi Mark, >>> >>&

Re: Aleph-umlaut

2018-11-11 Thread Hans Åberg via Unicode
> On 11 Nov 2018, at 07:03, Beth Myre via Unicode wrote: > > Hi Mark, > > This is a really cool find, and it's interesting that you might have a > relative mentioned in it. After looking at it more, I'm more convinced that > it's German written in Hebrew letters, not Yiddish. I think

Re: A sign/abbreviation for "magister"

2018-10-30 Thread Hans Åberg via Unicode
> On 30 Oct 2018, at 22:50, Ken Whistler via Unicode > wrote: > > On 10/30/2018 2:32 PM, James Kass via Unicode wrote: >> but we can't seem to agree on how to encode its abbreviation. > > For what it's worth, "mgr" seems to be the usual abbreviation in Polish for > it. That seems to be

Re: Unicode String Models

2018-09-12 Thread Hans Åberg via Unicode
> On 12 Sep 2018, at 04:34, Eli Zaretskii via Unicode > wrote: > >> Date: Wed, 12 Sep 2018 00:13:52 +0200 >> Cc: unicode@unicode.org >> From: Hans Åberg via Unicode >> >> It might be useful to represent non-UTF-8 bytes as Unicode code points.

Re: Unicode String Models

2018-09-11 Thread Hans Åberg via Unicode
> On 11 Sep 2018, at 23:48, Richard Wordingham via Unicode > wrote: > > On Tue, 11 Sep 2018 21:10:03 +0200 > Hans Åberg via Unicode wrote: > >> Indeed, before UTF-8, in the 1990s, I recall some Russians using >> LaTeX files with sections in differen

Re: Unicode String Models

2018-09-11 Thread Hans Åberg via Unicode
> On 11 Sep 2018, at 20:40, Eli Zaretskii wrote: > >> From: Hans Åberg >> Date: Tue, 11 Sep 2018 20:14:30 +0200 >> Cc: hsivo...@hsivonen.fi, >> unicode@unicode.org >> >> If one encounters a file with mixed encodings, it is good to be able to view

Re: Unicode String Models

2018-09-11 Thread Hans Åberg via Unicode
> On 11 Sep 2018, at 19:21, Eli Zaretskii wrote: > >> From: Hans Åberg >> Date: Tue, 11 Sep 2018 19:13:28 +0200 >> Cc: Henri Sivonen , >> unicode@unicode.org >> >>> In Emacs, each raw byte belonging >>> to a byte sequence which is invalid

Re: Unicode String Models

2018-09-11 Thread Hans Åberg via Unicode
> On 11 Sep 2018, at 13:13, Eli Zaretskii via Unicode > wrote: > > In Emacs, each raw byte belonging > to a byte sequence which is invalid under UTF-8 is represented as a > special multibyte sequence. IOW, Emacs's internal representation > extends UTF-8 with multibyte sequences it uses to

Re: Unicode String Models

2018-09-10 Thread Hans Åberg via Unicode
> On 9 Sep 2018, at 21:20, Eli Zaretskii via Unicode > wrote: > > In Emacs, the gap is always where the text is inserted or deleted, be > it in the middle of text or at its end. > >> All editors I have seen treat the text as ordered collections of small >> buffers (these small buffers may

Re: Can NFKC turn valid UAX 31 identifiers into non-identifiers?

2018-06-08 Thread Hans Åberg via Unicode
> On 8 Jun 2018, at 11:07, Henri Sivonen via Unicode > wrote: > > My question is: > > When designing a syntax where tokens with the user-chosen characters > can't occur next to each other without some syntax-reserved characters > between them, what advantages are there from limiting the

Re: Can NFKC turn valid UAX 31 identifiers into non-identifiers?

2018-06-07 Thread Hans Åberg via Unicode
Now that the distinction is possible, it is recommended to do that. My original question was directed to the OP, whether it is deliberate. And they are confusables only to those not accustomed to it. > On 7 Jun 2018, at 12:05, Philippe Verdy wrote: > > In my opinion the usual constant is

Re: Can NFKC turn valid UAX 31 identifiers into non-identifiers?

2018-06-07 Thread Hans Åberg via Unicode
> On 7 Jun 2018, at 03:56, Asmus Freytag via Unicode > wrote: > > On 6/6/2018 2:25 PM, Hans Åberg via Unicode wrote: >>> On 4 Jun 2018, at 21:49, Manish Goregaokar via Unicode >>> wrote: >>> >>> The Rust community is considering adding

Re: Can NFKC turn valid UAX 31 identifiers into non-identifiers?

2018-06-06 Thread Hans Åberg via Unicode
> On 4 Jun 2018, at 21:49, Manish Goregaokar via Unicode > wrote: > > The Rust community is considering adding non-ascii identifiers, which follow > UAX #31 (XID_Start XID_Continue*, with tweaks). The proposal also asks for > identifiers to be treated as equivalent under NFKC. So, in this

Re: Uppercase ß

2018-05-29 Thread Hans Åberg via Unicode
> On 29 May 2018, at 14:47, Arthur Reutenauer wrote: > >> The main point is what users of ẞ and ß would think, and Unicode to adjust >> accordingly. > > Since users of ß would think that in the vast majority of cases, it > ought to be uppercased to SS, I think you’re missing the main point.

Re: Uppercase ß

2018-05-29 Thread Hans Åberg via Unicode
> On 29 May 2018, at 12:55, Arthur Reutenauer wrote: > >> If uppercasing is not common, one would think that setting it too ẞ would >> pose no problems, no that it is available. > > It would, for reasons of stability. The main point is what users of ẞ and ß would think, and Unicode to

Re: Uppercase ß

2018-05-29 Thread Hans Åberg via Unicode
> On 29 May 2018, at 11:17, Werner LEMBERG wrote: > >> When looking for the lowercase ß LATIN SMALL LETTER SHARP S U+00DF >> in a MacOS Character Viewer, it does not give the uppercase version, >> for some reason. > > Yes, and it will stay so, AFAIK. The uppercase variant of `ß' is > `SS'.

Re: Uppercase ß

2018-05-29 Thread Hans Åberg via Unicode
> On 29 May 2018, at 10:54, Martin J. Dürst wrote: > > On 2018/05/29 17:15, Hans Åberg via Unicode wrote: >>> On 29 May 2018, at 07:30, Asmus Freytag via Unicode >>> wrote: > >>> An uppercase exists and it has formally been ruled as acceptable way to

Re: Uppercase ß

2018-05-29 Thread Hans Åberg via Unicode
> On 29 May 2018, at 07:30, Asmus Freytag via Unicode > wrote: > > On 5/28/2018 6:30 AM, Hans Åberg via Unicode wrote: >>> Unifying these would make a real mess of lower casing! >>> >> German has a special sign ß for "ss", without upper capi

Re: Unicode characters unification

2018-05-28 Thread Hans Åberg via Unicode
> On 28 May 2018, at 21:38, Richard Wordingham > wrote: > > On Mon, 28 May 2018 21:14:58 +0200 > Hans Åberg via Unicode wrote: > >>> On 28 May 2018, at 21:01, Richard Wordingham via Unicode >>> wrote: >>> >>> On Mon, 28 May

Re: Unicode characters unification

2018-05-28 Thread Hans Åberg via Unicode
> On 28 May 2018, at 21:01, Richard Wordingham via Unicode > wrote: > > On Mon, 28 May 2018 20:19:09 +0200 > Hans Åberg via Unicode wrote: > >> Indistinguishable math styles Latin and Greek uppercase letters have >> been added, even though that was not s

Re: Unicode characters unification

2018-05-28 Thread Hans Åberg via Unicode
> On 28 May 2018, at 19:18, Richard Wordingham via Unicode > wrote: > > On Mon, 28 May 2018 17:54:47 +0200 > Hans Åberg via Unicode wrote: > >>> On 28 May 2018, at 17:00, Richard Wordingham via Unicode >>> wrote: >>> >>> On Mon, 28 May

Re: Unicode characters unification

2018-05-28 Thread Hans Åberg via Unicode
> On 28 May 2018, at 17:00, Richard Wordingham via Unicode > <unicode@unicode.org> wrote: > > On Mon, 28 May 2018 15:30:55 +0200 > Hans Åberg via Unicode <unicode@unicode.org> wrote: > >>> On 28 May 2018, at 15:10, Richard Wordingham via Unicode >>&

Re: preliminary proposal: New Unicode characters for Arabic music half-flat and half-sharp symbols

2018-05-28 Thread Hans Åberg via Unicode
> On 28 May 2018, at 11:05, Julian Bradfield via Unicode <unicode@unicode.org> > wrote: > > On 2018-05-28, Hans Åberg via Unicode <unicode@unicode.org> wrote: >>> On 28 May 2018, at 03:39, Garth Wallace <gwa...@gmail.com> wrote: >>>>

Re: preliminary proposal: New Unicode characters for Arabic music half-flat and half-sharp symbols

2018-05-28 Thread Hans Åberg via Unicode
> On 28 May 2018, at 03:39, Garth Wallace <gwa...@gmail.com> wrote: > >> On Sun, May 27, 2018 at 3:36 PM, Hans Åberg <haber...@telia.com> wrote: >> The flats and sharps of Arabic music are semantically the same as in Western >> music, departing from Pythagor

Re: preliminary proposal: New Unicode characters for Arabic music half-flat and half-sharp symbols

2018-05-27 Thread Hans Åberg via Unicode
work could be used to denote someone whistling, without actually encoding > any specific tone, or rythmic). > > > 2018-05-17 17:48 GMT+02:00 Hans Åberg via Unicode <unicode@unicode.org>: > > > > On 17 May 2018, at 16:47, Garth Wallace via Unicode <unicode@

Re: preliminary proposal: New Unicode characters for Arabic music half-flat and half-sharp symbols

2018-05-17 Thread Hans Åberg via Unicode
> On 17 May 2018, at 16:47, Garth Wallace via Unicode <unicode@unicode.org> > wrote: > > On Thu, May 17, 2018 at 12:41 AM Hans Åberg <haber...@telia.com> wrote: > > > On 17 May 2018, at 08:47, Garth Wallace via Unicode <unicode@unicode.org> > > w

Re: preliminary proposal: New Unicode characters for Arabic music half-flat and half-sharp symbols

2018-05-17 Thread Hans Åberg via Unicode
> On 17 May 2018, at 08:47, Garth Wallace via Unicode <unicode@unicode.org> > wrote: > >> On Wed, May 16, 2018 at 12:42 AM, Hans Åberg via Unicode >> <unicode@unicode.org> wrote: >> >> It would be best to encode the SMuFL symbols, which is rather c

Re: preliminary proposal: New Unicode characters for Arabic music half-flat and half-sharp symbols

2018-05-16 Thread Hans Åberg via Unicode
> On 16 May 2018, at 09:42, Hans Åberg via Unicode <unicode@unicode.org> wrote: > >> On 16 May 2018, at 00:48, Ken Whistler via Unicode <unicode@unicode.org> >> wrote: >> >>> A proposal should also show evidence of usage and glyph

Re: preliminary proposal: New Unicode characters for Arabic music half-flat and half-sharp symbols

2018-05-16 Thread Hans Åberg via Unicode
> On 16 May 2018, at 00:48, Ken Whistler via Unicode > wrote: > > On 5/15/2018 2:46 PM, Markus Scherer via Unicode wrote: >> I am proposing the addition of 2 new characters to the Musical Symbols table: >> >> - the half-flat sign (lowers a note by a quarter tone) >> -

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-18 Thread Hans Åberg via Unicode
> On 16 May 2017, at 15:21, Richard Wordingham via Unicode > <unicode@unicode.org> wrote: > > On Tue, 16 May 2017 14:44:44 +0200 > Hans Åberg via Unicode <unicode@unicode.org> wrote: > >>> On 15 May 2017, at 12:21, Henri Sivonen via Unicode >>> &

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-17 Thread Hans Åberg via Unicode
> On 17 May 2017, at 23:18, Doug Ewell <d...@ewellic.org> wrote: > > Hans Åberg wrote: > >>> Far from solving the stated problem, it would introduce a new one: >>> conversion from the "bad data" Unicode code points, currently >>> well-def

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-17 Thread Hans Åberg via Unicode
> On 17 May 2017, at 22:36, Doug Ewell via Unicode <unicode@unicode.org> wrote: > > Hans Åberg wrote: > >> It would be useful, for use with filesystems, to have Unicode >> codepoint markers that indicate how UTF-8, including non-valid >> sequences

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-16 Thread Hans Åberg via Unicode
> On 16 May 2017, at 20:01, Philippe Verdy wrote: > > On Windows NTFS (and LFN extension of FAT32 and exFAT) at least, random > sequences of 16-bit code units are not permitted. There's visibly a > validation step that returns an error if you attempt to create files with

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-16 Thread Hans Åberg via Unicode
> On 16 May 2017, at 18:38, Alastair Houghton <alast...@alastairs-place.net> > wrote: > > On 16 May 2017, at 17:23, Hans Åberg <haber...@telia.com> wrote: >> >> HFS implements case insensitivity in a layer above the filesystem raw >> functions.

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-16 Thread Hans Åberg via Unicode
> On 16 May 2017, at 18:13, Alastair Houghton <alast...@alastairs-place.net> > wrote: > > On 16 May 2017, at 17:07, Hans Åberg <haber...@telia.com> wrote: >> >>>>> HFS(+), NTFS and VFAT long filenames are all encoded in some variation on >

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-16 Thread Hans Åberg via Unicode
> On 16 May 2017, at 17:52, Alastair Houghton <alast...@alastairs-place.net> > wrote: > > On 16 May 2017, at 16:44, Hans Åberg <haber...@telia.com> wrote: >> >> On 16 May 2017, at 17:30, Alastair Houghton via Unicode >> <unicode@unicode.or

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-16 Thread Hans Åberg via Unicode
> On 16 May 2017, at 17:30, Alastair Houghton via Unicode <unicode@unicode.org> > wrote: > > On 16 May 2017, at 14:23, Hans Åberg via Unicode <unicode@unicode.org> wrote: >> >> You don't. You have a filename, which is a octet sequence of un

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-16 Thread Hans Åberg via Unicode
> On 16 May 2017, at 15:00, Philippe Verdy <verd...@wanadoo.fr> wrote: > > 2017-05-16 14:44 GMT+02:00 Hans Åberg via Unicode <unicode@unicode.org>: > > > On 15 May 2017, at 12:21, Henri Sivonen via Unicode <unicode@unicode.org> > > wrote: > ... >

Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

2017-05-16 Thread Hans Åberg via Unicode
> On 15 May 2017, at 12:21, Henri Sivonen via Unicode > wrote: ... > I think Unicode should not adopt the proposed change. It would be useful, for use with filesystems, to have Unicode codepoint markers that indicate how UTF-8, including non-valid sequences, is translated

Re: How to Add Beams to Notes

2017-05-05 Thread Hans Åberg via Unicode
> On 1 May 2017, at 21:12, Michael Bear via Unicode wrote: > > I am trying to make a music notation font. It will use the Musical Symbols > block in Unicode (1D100-1D1FF), but, since that block has a bad rep for not > being very complete, I added some extra characters...

Re: Diaeresis vs. umlaut (was: Re: Standaridized variation sequences for the Desert alphabet?)

2017-03-24 Thread Hans Åberg
> On 24 Mar 2017, at 19:33, Doug Ewell wrote: > > Philippe Verdy wrote: > >> But Unicode just prefered to keep the roundtrip compatiblity with >> earlier 8-bit encodings (including existing ISO 8859 and DIN >> standards) so that "ü" in German and French also have the same >>

Re: Northern Khmer on iPhone

2017-02-28 Thread Hans Åberg
> On 28 Feb 2017, at 22:00, Richard Wordingham > wrote: > > On Tue, 28 Feb 2017 07:37:10 + > Richard Wordingham wrote: > >> Does iPhone support the use of Northern Khmer in Thai script? I would >> count an interface in

Re: Encoding West African Adinkra sysmbols

2017-01-22 Thread Hans Åberg
> On 22 Jan 2017, at 18:21, Michael Everson wrote: > > Are they used in plain text? How? On textiles and walls in a similar fashion as emoji, it seems [1]. Known since the beginning of the 19th century. 1. https://en.wikipedia.org/wiki/Adinkra_symbols

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 23:39, Doug Ewell <d...@ewellic.org> wrote: > > Hans Åberg wrote: > >>>>> What do you mean? The IPA in narrow transcription is intended to >>>>> provide as detailed a description as a human mind can manage of >

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 23:01, Julian Bradfield <jcb+unic...@inf.ed.ac.uk> wrote: > > On 2016-10-10, Hans Åberg <haber...@telia.com> wrote: >>> On 10 Oct 2016, at 22:15, Julian Bradfield <jcb+unic...@inf.ed.ac.uk> wrote: >>> What do you mean

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 22:31, Julian Bradfield <jcb+unic...@inf.ed.ac.uk> wrote: > > On 2016-10-10, Hans Åberg <haber...@telia.com> wrote: >> It is possible to write math just using ASCII and TeX, which was the >> original idea of TeX. Is that want you want

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 22:15, Julian Bradfield <jcb+unic...@inf.ed.ac.uk> wrote: > > On 2016-10-10, Hans Åberg <haber...@telia.com> wrote: >>> On 10 Oct 2016, at 21:42, Doug Ewell <d...@ewellic.org> wrote: >>> Hans Åberg wrote: >>>&

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 21:43, Julian Bradfield wrote: > Linguists aren't stupid, and they have no need for plain text > representations of all their symbology. Linguists write in Word or > LaTeX (or sometimes HTML), all of which can produce a wide range of > symbols

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 21:42, Doug Ewell <d...@ewellic.org> wrote: > > Hans Åberg wrote: > >> I think that IPA might be designed for broad phonetic transcriptions >> [1], with a requirement to distinguish phonemes within each given >> language. >

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 15:24, Julian Bradfield <jcb+unic...@inf.ed.ac.uk> wrote: > > On 2016-10-10, Hans Åberg <haber...@telia.com> wrote: >> I think that IPA might be designed for broad phonetic transcriptions >> [1], with a requirement to distinguish phone

Re: Why incomplete subscript/superscript alphabet ?

2016-10-10 Thread Hans Åberg
> On 10 Oct 2016, at 03:13, Doug Ewell wrote: > > Denis Jacquerye wrote: > >> Regarding the superscript q, in some rare cases, it is used to >> indicate pharyngealization or a pharyngeal consonant instead of the >> Latin letter pharyngeal voiced fricative U+0295 ʕ, the

Re: Bit arithmetic on Unicode characters?

2016-10-09 Thread Hans Åberg
> On 9 Oct 2016, at 13:00, Mark Davis ☕️ wrote: > > Essentially all of the game pieces that are in Unicode were added for > compatibility with existing character sets. ​I'm guessing that ​there are > hundreds to thousands of possible other symbols associated with games in

Re: Why incomplete subscript/superscript alphabet ?

2016-10-08 Thread Hans Åberg
> On 8 Oct 2016, at 12:03, Julian Bradfield wrote: > > I happen to think the whole math alphabet thing was a dumb > mistake. They are useful in mathematics, but other sciences may not use them. > But even if it isn't - and incidentally in some communities > there is

Re: Bit arithmetic on Unicode characters?

2016-10-07 Thread Hans Åberg
> On 7 Oct 2016, at 18:06, Doug Ewell wrote: > I can't find anything in the UCD that distinguishes one "font variant" > from another (UnicodeData.txt shown as an example): > > 1D400;MATHEMATICAL BOLD CAPITAL A;Lu;0;L; 0041N; > 1D434;MATHEMATICAL ITALIC CAPITAL

Re: Bit arithmetic on Unicode characters?

2016-10-07 Thread Hans Åberg
> On 7 Oct 2016, at 09:27, Garth Wallace wrote: > > Unicode doesn't really address chess piece properties like white/black beyond > naming conventions. >From the formal point of view, Unicode only assigns character numbers (code >points), which gets a binary representation

Re: Why incomplete subscript/superscript alphabet ?

2016-10-01 Thread Hans Åberg
> On 1 Oct 2016, at 15:48, lorieul wrote: > Indeed Latex formulas are often not easy to > decypher… One can improve readability by using more Unicode characters [1] and the unicode-math package [2], or switching to ConTeXt ,

Re: Math upright Latin and Greek styles

2016-05-16 Thread Hans Åberg
> On 16 May 2016, at 18:56, Philippe Verdy wrote: > > I do not advocate changing that, but these legacy *TeX variants have their > own builtin sets of supported fonts with their implicit style and use them > with the normal letters, just like what is done in HTML when you

Re: Math upright Latin and Greek styles

2016-05-16 Thread Hans Åberg
> On 16 May 2016, at 03:30, Philippe Verdy wrote: > > isn't it specified in TeX using a font selection package instead of the > default one? Also the only upright letters I saw was for inserting normal > text (not mathematical symbols) or comments/descriptions, or when

Re: Math upright Latin and Greek styles

2016-05-15 Thread Hans Åberg
> On 16 May 2016, at 00:05, Murray Sargent <murr...@exchange.microsoft.com> > wrote: > > Hans Åberg mentioned "Changing Basic Latin and Greek to upright does not seem > practical, due to legacy and lack of efficient input methods." > > Have to say that it

Re: Math upright Latin and Greek styles

2016-05-15 Thread Hans Åberg
> On 15 May 2016, at 23:19, Murray Sargent <murr...@exchange.microsoft.com> > wrote: > > Hans Åberg asked, ”Are there any plans to add math upright Latin and Greek > styles, in order to distinguish them from regular (non-math) Latin and Greek? > —In programs like TeX,

Re: Unicode in passwords

2015-09-30 Thread Hans Åberg
> On 30 Sep 2015, at 18:33, John O'Conner wrote: > > Can you recommend any documents to help me understand potential issues (if > any) for password policies and validation methods that allow characters from > more "exotic" portions of the Unicode space? On UNIX

Re: Chess symbol glyphs in code charts

2015-08-14 Thread Hans Åberg
On 14 Aug 2015, at 20:31, Garth Wallace gwa...@gmail.com wrote: Can anyone tell me what font is used for the chess symbols in the code chart for the Miscellaneous Symbols block? It looks a lot like Chess Merida but I can't be certain. They are quite close to Apple Symbols, but not exactly