2014-06-05 0:48 GMT+02:00 Doug Ewell d...@ewellic.org:
If you are processing arbitrary fragments of a stream, without knowledge
of preceding fragments, as in this example, then you have no business
making *any* changes to that fragment based on interpretation of that
fragment as Unicode text.
On 5 Jun 2014, at 04:50, David Starner prosfil...@gmail.com wrote:
On Wed, Jun 4, 2014 at 6:00 AM, Jukka K. Korpela jkorp...@cs.tut.fi wrote:
The change is logical in the sense that bold face is a
more original notation and double-struck letters as characters imitate the
imitation of boldface
On Wednesday 04 June 2014 10:53:59 Shawn Steele wrote:
I’m sort of confused why Unicode would be a big deal. C# other languages
have allowed unicode letters in identifiers for years, so readable strings
should be possible in almost any language.
It’s a bit cute to include emoji, but I’m not
On Thu, Jun 5, 2014 at 3:04 AM, J. Leslie Turriff
jlturr...@centurylink.net wrote:
What I find interesting is that (with the possible exception of Ada)
I don't
think that any of the commonly used languages allow for the use of Unicode
characters for non- user-defined tokens (i.e.
Le 05/06/2014 12:52, David Starner a écrit :
On Thu, Jun 5, 2014 at 3:04 AM, J. Leslie Turriff
jlturr...@centurylink.net wrote:
What I find interesting is that (with the possible exception of Ada) I
don't
think that any of the commonly used languages allow for the use of Unicode
Am 04.06.14 11:28, schrieb Andre Schappo:
The restrictions seem a little like IDNA2008. Anyone have links to
info giving a detailed explanation/tabulation of allowed and non
allowed Unicode chars for Swift Variable and Constant names?
The language reference is at
Has anyone figured out whether character sequences that are non-canonical
(de)compositions but could be recomposed to the same result
are the same identifier or not?
That is: are identifiers merely sequences of characters or intended to be
comparable as “Unicode strings” (under some sort of
I haven't done any analysis, but on first glance it looks like it is based
on
http://www.unicode.org/reports/tr31/#Alternative_Identifier_Syntax
Mark https://google.com/+MarkDavis
*— Il meglio è l’inimico del bene —*
On Thu, Jun 5, 2014 at 5:46 PM, Jeff Senn s...@maya.com wrote:
Has
On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote:
That is: are identifiers merely sequences of characters or intended to be
comparable as “Unicode strings” (under some sort of compatibility rule)?
In computer languages, identifiers are normally compared only for equality, as
it reduces
On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote:
On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote:
That is: are identifiers merely sequences of characters or intended to be
comparable as “Unicode strings” (under some sort of compatibility rule)?
In computer
On Thu, 5 Jun 2014 09:41:07 +0200
Philippe Verdy verd...@wanadoo.fr wrote:
You'll probably want to sync on the first newline control and then
proceed from that point. But now if you have those devices configured
heterogenously and generating their own output encoding you won't
necessarily
On Thursday 05 June 2014 06:10:42 Frédéric Grosshans wrote:
Le 05/06/2014 12:52, David Starner a écrit :
On Thu, Jun 5, 2014 at 3:04 AM, J. Leslie Turriff
jlturr...@centurylink.net wrote:
What I find interesting is that (with the possible exception of
Ada) I don't think that
On Thursday 05 June 2014 12:24:12 Jeff Senn wrote:
On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote:
On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote:
That is: are identifiers merely sequences of characters or intended to
be comparable as “Unicode strings” (under some
On Jun 5, 2014, at 2:22 PM, J. Leslie Turriff jlturr...@centurylink.net wrote:
On Thursday 05 June 2014 12:24:12 Jeff Senn wrote:
On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote:
On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote:
That is: are identifiers merely
On 5 Jun 2014, at 19:24, Jeff Senn s...@maya.com wrote:
On Jun 5, 2014, at 12:41 PM, Hans Aberg haber...@telia.com wrote:
On 5 Jun 2014, at 17:46, Jeff Senn s...@maya.com wrote:
That is: are identifiers merely sequences of characters or intended to be
comparable as “Unicode strings”
Le jeudi, 5 juin 2014 à 18:24, Jeff Senn a écrit :
If your implication is that there should be no canonicalization (the string
from the source is used as a sequence of characters only directly mapped to a
symbol), then I predict sticky problems in the future.
Note that this is actually
Philippe Verdy verdy underscore p at wanadoo dot fr wrote:
Not necessarily true.
[602 words]
This has nothing to do with the scenario I described, which involved
removing a BOM from the start of an arbitrary fragment of data,
thereby corrupting the data because the BOM was actually a ZWNBSP.
On Thu, Jun 5, 2014 at 11:14 AM, J. Leslie Turriff
jlturr...@centurylink.net wrote:
All true; but do any languages allow for keywords (if, then, else,
do, while,
until, end, iterate, leave, call return, exit,...) to be expressed in the
programmer's locale?
Both ALGOL 60 and ALGOL 68
2014-06-05 21:46 GMT+02:00 Doug Ewell d...@ewellic.org:
Philippe Verdy verdy underscore p at wanadoo dot fr wrote:
Not necessarily true.
[602 words]
This has nothing to do with the scenario I described, which involved
removing a BOM from the start of an arbitrary fragment of data,
A few years ago there was a company in Australia that was developing a
multilingual language called Protium Blue. The lead was someone named
Diarmuid Pigott. As far as I can tell, the project has come to an end, but
one can still find bits about the project, e.g. this:
IMHO, a programming language that accepts non-ASCII identifiers should
always nrmalize the identifiers it accepts, before heeding it in its hashed
symbol table.
And for this type of usage, we strongly need that normalization is stable,
but much more than with existing stability rules: the
Hmmm.
Any programming language project that derives from someone who describes
himself as a “polyhistor”, which claims to be polymorphic and pasigraphic and
multi-lingual and orthogonal and polysynthetic, which draws its inspiration
from the
theory of “Natural Language Metasemantics”, and which
Warning !
This definition of allowed identifiers has severe security risks: it does
not support any kind of normalization or canonical equivalence, and it's
impossible to use normalization in the language lexer/parser while making
sure that they will be stable over the set of unassigned
On Thu, Jun 5, 2014 at 5:00 PM, Whistler, Ken ken.whist...@sap.com wrote:
Any programming language project that derives from someone who describes
himself as a “polyhistor”, which claims to be polymorphic and pasigraphic
and
multi-lingual and orthogonal and polysynthetic, which draws its
24 matches
Mail list logo