This is going to cause some fun. I wonder, in a case like this, which of the two scripts takes precedence?
- C On 25/11/2010, Shriramana Sharma <[email protected]> wrote: > Hello. Here's a Telugu vs Kannada confusables list I cooked up right > now. As this is an important security issue, I post to all the lists > so that people may contribute. Also, some of this is probably already > there but I'm going for completeness: > ANUSVARA > VISARGA > LETTER A ... <SNIP> ... > DIGIT NINE > That makes sixty two characters in all including the ones marked ?. > Even *without* the ones marked ? (which I did because I suspected > others may contest these cases) it comes to fifty two. > Now to count the characters NOT common or confusable (obviously much > lesser): > > LETTER U > LETTER UU ... <SNIP> ... > DIGIT SEVEN > That comes to sixteen. > I left out the LLLA of Kannada and the fractions of Telugu which are > *not* present (as of Unicode 6.0) in the other script because > obviously there can be no comparison on those. > So there are at least *thrice* (or at most *four times*) as many > confusable characters between Kannada and Telugu than there are > NON-confusables. > Now can you beat that! Speaking of scripts with a common origin and > causing potential confusion in IDNs, *I* say Kannada and Telugu takes > the cake! > Shriramana Sharma.

