[issue46572] Unicode identifiers not necessarily unique

2022-01-29 Thread Diego Argueta
Diego Argueta added the comment: I did read PEP-3131 before posting this but I still thought the behavior was counterintuitive. -- ___ Python tracker ___

[issue46572] Unicode identifiers not necessarily unique

2022-01-29 Thread Eryk Sun
Eryk Sun added the comment: Please read "Identifiers and keywords" [1] in the documentation. For example: >>> import unicodedata as ud >>> ud.normalize('NFKC', '햇햆햗') == 'bar' True >>> c = '\N{CYRILLIC SMALL LETTER A}' >>> ud.name(ud.normalize('NFKC', c)) 'CYRILLIC

[issue46572] Unicode identifiers not necessarily unique

2022-01-29 Thread Pablo Galindo Salgado
Pablo Galindo Salgado added the comment: This seems coherent with https://www.python.org/dev/peps/pep-3131/ to me. The parser ensures all identifiers are converted into the normal form NFKC while parsing; comparison of identifiers is based on NFKC. --

[issue46572] Unicode identifiers not necessarily unique

2022-01-29 Thread Diego Argueta
New submission from Diego Argueta : The way Python 3 handles identifiers containing mathematical characters appears to be broken. I didn't test the entire range of U+1D400 through U+1D59F but I spot-checked them and the bug manifests itself there: Python 3.9.7 (default, Sep 10 2021,