Re: Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

2014-06-05 Thread Philippe Verdy
2014-06-05 0:48 GMT+02:00 Doug Ewell d...@ewellic.org: If you are processing arbitrary fragments of a stream, without knowledge of preceding fragments, as in this example, then you have no business making *any* changes to that fragment based on interpretation of that fragment as Unicode text.

Re: Math input methods

2014-06-05 Thread Hans Aberg
On 5 Jun 2014, at 04:50, David Starner prosfil...@gmail.com wrote: On Wed, Jun 4, 2014 at 6:00 AM, Jukka K. Korpela jkorp...@cs.tut.fi wrote: The change is logical in the sense that bold face is a more original notation and double-struck letters as characters imitate the imitation of boldface

Re: Swift

2014-06-05 Thread J. Leslie Turriff
On Wednesday 04 June 2014 10:53:59 Shawn Steele wrote: I’m sort of confused why Unicode would be a big deal. C# other languages have allowed unicode letters in identifiers for years, so readable strings should be possible in almost any language. It’s a bit cute to include emoji, but I’m not

Re: Swift

2014-06-05 Thread David Starner
On Thu, Jun 5, 2014 at 3:04 AM, J. Leslie Turriff jlturr...@centurylink.net wrote: What I find interesting is that (with the possible exception of Ada) I don't think that any of the commonly used languages allow for the use of Unicode characters for non- user-defined tokens (i.e.

Re: Swift

2014-06-05 Thread Frédéric Grosshans
Le 05/06/2014 12:52, David Starner a écrit : On Thu, Jun 5, 2014 at 3:04 AM, J. Leslie Turriff jlturr...@centurylink.net wrote: What I find interesting is that (with the possible exception of Ada) I don't think that any of the commonly used languages allow for the use of Unicode

Re: Swift

2014-06-05 Thread Martin v. Löwis
Am 04.06.14 11:28, schrieb Andre Schappo: The restrictions seem a little like IDNA2008. Anyone have links to info giving a detailed explanation/tabulation of allowed and non allowed Unicode chars for Swift Variable and Constant names? The language reference is at

Re: Swift

2014-06-05 Thread Jeff Senn
Has anyone figured out whether character sequences that are non-canonical (de)compositions but could be recomposed to the same result are the same identifier or not? That is: are identifiers merely sequences of characters or intended to be comparable as “Unicode strings” (under some sort of

Re: Swift

2014-06-05 Thread Mark Davis ☕️
I haven't done any analysis, but on first glance it looks like it is based on http://www.unicode.org/reports/tr31/#Alternative_Identifier_Syntax Mark https://google.com/+MarkDavis *— Il meglio è l’inimico del bene —* On Thu, Jun 5, 2014 at 5:46 PM, Jeff Senn s...@maya.com wrote: Has

Re: Swift

2014-06-05 Thread Hans Aberg
On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote: That is: are identifiers merely sequences of characters or intended to be comparable as “Unicode strings” (under some sort of compatibility rule)? In computer languages, identifiers are normally compared only for equality, as it reduces

Re: Swift

2014-06-05 Thread Jeff Senn
On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote: On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote: That is: are identifiers merely sequences of characters or intended to be comparable as “Unicode strings” (under some sort of compatibility rule)? In computer

Re: Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

2014-06-05 Thread Richard Wordingham
On Thu, 5 Jun 2014 09:41:07 +0200 Philippe Verdy verd...@wanadoo.fr wrote: You'll probably want to sync on the first newline control and then proceed from that point. But now if you have those devices configured heterogenously and generating their own output encoding you won't necessarily

Re: Swift

2014-06-05 Thread J. Leslie Turriff
On Thursday 05 June 2014 06:10:42 Frédéric Grosshans wrote: Le 05/06/2014 12:52, David Starner a écrit : On Thu, Jun 5, 2014 at 3:04 AM, J. Leslie Turriff jlturr...@centurylink.net wrote: What I find interesting is that (with the possible exception of Ada) I don't think that

Re: Swift

2014-06-05 Thread J. Leslie Turriff
On Thursday 05 June 2014 12:24:12 Jeff Senn wrote: On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote: On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote: That is: are identifiers merely sequences of characters or intended to be comparable as “Unicode strings” (under some

Re: Swift

2014-06-05 Thread Jeff Senn
On Jun 5, 2014, at 2:22 PM, J. Leslie Turriff jlturr...@centurylink.net wrote: On Thursday 05 June 2014 12:24:12 Jeff Senn wrote: On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote: On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote: That is: are identifiers merely

Re: Swift

2014-06-05 Thread Hans Aberg
On 5 Jun 2014, at 19:24, Jeff Senn s...@maya.com wrote: On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote: On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote: That is: are identifiers merely sequences of characters or intended to be comparable as “Unicode strings”

Re: Swift

2014-06-05 Thread Daniel Bünzli
Le jeudi, 5 juin 2014 à 18:24, Jeff Senn a écrit : If your implication is that there should be no canonicalization (the string from the source is used as a sequence of characters only directly mapped to a symbol), then I predict sticky problems in the future. Note that this is actually

Re: Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

2014-06-05 Thread Doug Ewell
Philippe Verdy verdy underscore p at wanadoo dot fr wrote: Not necessarily true. [602 words] This has nothing to do with the scenario I described, which involved removing a BOM from the start of an arbitrary fragment of data, thereby corrupting the data because the BOM was actually a ZWNBSP.

Re: Swift

2014-06-05 Thread David Starner
On Thu, Jun 5, 2014 at 11:14 AM, J. Leslie Turriff jlturr...@centurylink.net wrote: All true; but do any languages allow for keywords (if, then, else, do, while, until, end, iterate, leave, call return, exit,...) to be expressed in the programmer's locale? Both ALGOL 60 and ALGOL 68

Re: Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

2014-06-05 Thread Philippe Verdy
2014-06-05 21:46 GMT+02:00 Doug Ewell d...@ewellic.org: Philippe Verdy verdy underscore p at wanadoo dot fr wrote: Not necessarily true. [602 words] This has nothing to do with the scenario I described, which involved removing a BOM from the start of an arbitrary fragment of data,

Re: Swift

2014-06-05 Thread Bill Poser
A few years ago there was a company in Australia that was developing a multilingual language called Protium Blue. The lead was someone named Diarmuid Pigott. As far as I can tell, the project has come to an end, but one can still find bits about the project, e.g. this:

Re: Swift

2014-06-05 Thread Philippe Verdy
IMHO, a programming language that accepts non-ASCII identifiers should always nrmalize the identifiers it accepts, before heeding it in its hashed symbol table. And for this type of usage, we strongly need that normalization is stable, but much more than with existing stability rules: the

RE: Swift

2014-06-05 Thread Whistler, Ken
Hmmm. Any programming language project that derives from someone who describes himself as a “polyhistor”, which claims to be polymorphic and pasigraphic and multi-lingual and orthogonal and polysynthetic, which draws its inspiration from the theory of “Natural Language Metasemantics”, and which

Re: Swift

2014-06-05 Thread Philippe Verdy
Warning ! This definition of allowed identifiers has severe security risks: it does not support any kind of normalization or canonical equivalence, and it's impossible to use normalization in the language lexer/parser while making sure that they will be stable over the set of unassigned

Re: Swift

2014-06-05 Thread David Starner
On Thu, Jun 5, 2014 at 5:00 PM, Whistler, Ken ken.whist...@sap.com wrote: Any programming language project that derives from someone who describes himself as a “polyhistor”, which claims to be polymorphic and pasigraphic and multi-lingual and orthogonal and polysynthetic, which draws its