[Python-Dev] Support of UTF-16 and UTF-32 source encodings

Serhiy Storchaka Sat, 14 Nov 2015 11:21:49 -0800

For now UTF-16 and UTF-32 source encodings are not supported. There is acomment in Parser/tokenizer.c:


    /* Disable support for UTF-16 BOMs until a decision
       is made whether this needs to be supported.  */

Can we make a decision whether this support will be added in foreseeablefuture (say in near 10 years), or no?

Removing commented out and related code will help to refactor thetokenizer, and that can help to fix some existing bugs (e.g. issue14811,issue18961, issue20115 and may be others). Current tokenizing code istoo tangled.

If the support of UTF-16 and UTF-32 is planned, I'll take this toattention during refactoring. But in many places besides the tokenizerthe ASCII compatible encoding of source files is expected.


_______________________________________________
Python-Dev mailing list
[email protected]
https://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
https://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

[Python-Dev] Support of UTF-16 and UTF-32 source encodings

Reply via email to