[issue40980] group names of bytes regexes are strings

Ma Lin Tue, 16 Jun 2020 05:12:16 -0700

Ma Lin <[email protected]> added the comment:

In this case, you can only use 'latin1', which directly map one character 
(\u0000-\u00FF) to/from one byte.


If use 'utf-8', it may map one character to multiple bytes, such as 'Δ' -> 
b'\xce\x94'

'\x94' is an invalid identifier, it will raise an error:

    >>> '\xce'.isidentifier()   # '\xce' is 'Î'
    True
    >>> '\x94'.isidentifier()
    False

You may close this issue (I can't close it), we can continue the discussion.

----------

_______________________________________
Python tracker <[email protected]>
<https://bugs.python.org/issue40980>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue40980] group names of bytes regexes are strings

Reply via email to