Re: Mixed-Script confusables in prog.languages

2016-12-05 Thread Richard Wordingham
On Mon, 5 Dec 2016 09:31:11 +0100 Reini Urban wrote: > > On Dec 4, 2016, at 11:45 PM, Richard Wordingham > > wrote: > > > > On Sun, 4 Dec 2016 12:09:36 +0100 > > Reini Urban wrote: > > > >> * normalize identifiers (NFC)

Re: Mixed-Script confusables in prog.languages

2016-12-05 Thread gfb hjjhjh
How about package names like ロシアМС21(Note the МС are Cyrillic), or πr²の秘密, or エリ_хорошо_μ'sic_4⃣ever? Although they aren't really names that people would usually use in package/var names, they are meaningful names... 2016年12月5日 16:39 於 "Reini Urban" 寫道: > > > On Dec 4, 2016,

Re: Mixed-Script confusables in prog.languages

2016-12-05 Thread Martin J. Dürst
On 2016/12/05 04:07, Philippe Verdy wrote: In more technical programming languages however, you can usually be much more restrictive as the identifiers used are generally abbreviated and simplified: you can kill lettercase differences for example, In some languages maybe. But languages such

Re: Mixed-Script confusables in prog.languages

2016-12-05 Thread Martin J. Dürst
On 2016/12/05 17:31, Reini Urban wrote: ψ_S contains Greek U+03C8, Common and Latin. Since Latin and Common are always allowed, the only new script is Greek. The first non-default script is automatically and silently allowed, only a mix with another non-default script, such as Cyrillic

Re: Mixed-Script confusables in prog.languages

2016-12-05 Thread Reini Urban
> On Dec 4, 2016, at 11:45 PM, Richard Wordingham > wrote: > > On Sun, 4 Dec 2016 12:09:36 +0100 > Reini Urban wrote: > >> * normalize identifiers (NFC) and only store normalized variants. >> this should catch bidi spoofs, combining