[issue34763] Python lacks 0x4E17

2019-06-03 Thread 林自均
林自均 added the comment: "丗" means "30" in Japanese. However, it is a variant Chinese character to "世", where "世" means "world" in Chinese. I'm not sure if this information makes any difference. -- nosy: +johnlinp ___ Python tracker

[issue34763] Python lacks 0x4E17

2019-05-31 Thread Berker Peksag
Berker Peksag added the comment: Tools/unicode/makeunicodedata.py looks at Unihan database for the fields kAccountingNumeric, kOtherNumeric, and kPrimaryNumeric in Unihan_NumericValues.txt:

[issue34763] Python lacks 0x4E17

2018-09-27 Thread Steven D'Aprano
Change by Steven D'Aprano : -- nosy: +steven.daprano ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue34763] Python lacks 0x4E17

2018-09-25 Thread Xiang Zhang
Change by Xiang Zhang : -- nosy: +xiang.zhang ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Benjamin Peterson
Benjamin Peterson added the comment: As I said on the PR, this is because Unicode gives U+4E17 (and other CJK ideographs) a numeric value only in the UniHan database not the normal UCD. makeunicodedata.py only looks at UCD for numeric values. -- nosy: +benjamin.peterson

[issue34763] Python lacks 0x4E17

2018-09-21 Thread STINNER Victor
STINNER Victor added the comment: $ ./python Python 3.8.0a0 (heads/master-dirty:06e7608207, Sep 20 2018, 01:52:01) >>> import unicodedata >>> unicodedata.unidata_version '11.0.0' >>> unicodedata.numeric('\u5345') 30.0 >>> unicodedata.numeric('\u4E17') ValueError: not a numeric character

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Matthew Barnett
Change by Matthew Barnett : -- Removed message: https://bugs.python.org/msg326012 ___ Python tracker ___ ___ Python-bugs-list

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Matthew Barnett
Change by Matthew Barnett : -- Removed message: https://bugs.python.org/msg326014 ___ Python tracker ___ ___ Python-bugs-list

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Matthew Barnett
Change by Matthew Barnett : -- Removed message: https://bugs.python.org/msg326013 ___ Python tracker ___ ___ Python-bugs-list

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Matthew Barnett
Change by Matthew Barnett : -- Removed message: https://bugs.python.org/msg326015 ___ Python tracker ___ ___ Python-bugs-list

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Matthew Barnett
Matthew Barnett added the comment: Unicode 11.0.0 has 卅 (U+5345) as being numeric and having the value 30. What's the difference between that and U+4E17? I notice that they look at lot alike. Are they different variants, perhaps traditional vs simplified? --

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: We use the Unicode database for these methods. Could you please check whether the database marks the character as numeric ? If yes, we may need to check the database generation. Otherwise, there isn't much we can do, since we use the Unicode database as

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Serhiy Storchaka
Change by Serhiy Storchaka : -- nosy: +lemburg ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue34763] Python lacks 0x4E17

2018-09-21 Thread Roundup Robot
Change by Roundup Robot : -- keywords: +patch pull_requests: +8887 stage: -> patch review ___ Python tracker ___ ___

[issue34763] Python lacks 0x4E17

2018-09-21 Thread 草木建
New submission from 草木建 : This is very easy issue. 丗 meanning is 30.(丗 is 0x4E17) "丗".isnumeric() must returns true. but "丗".isnumeric() returns False. -- components: Unicode messages: 325992 nosy: ezio.melotti, vstinner, 草木建 priority: normal severity: normal status: open title: