[issue36486] Bugs and inconsistencies in unicodedata

2019-04-05 Thread Terry J. Reedy


Change by Terry J. Reedy :


--
stage:  -> needs patch
versions: +Python 3.8

___
Python tracker 

___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue36486] Bugs and inconsistencies in unicodedata

2019-03-30 Thread Karthikeyan Singaravelan


Change by Karthikeyan Singaravelan :


--
nosy: +lemburg

___
Python tracker 

___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue36486] Bugs and inconsistencies in unicodedata

2019-03-30 Thread David Corbett

New submission from David Corbett :

In `unicodedata`, the functions `lookup` and `name` have some bugs and 
inconsistencies.

`lookup` matches case-insensitively, except for the algorithmic names of Hangul 
syllables and CJK unified ideographs, which must be in all caps. The 
documentation does not explain how character names are fuzzily matched.

`lookup` accepts names like “CJK UNIFIED IDEOGRAPH-04E00”, where the code point 
has a leading zero.

`lookup` and `name` don’t implement rule NR2, defined in chapter 4 of Unicode, 
for Tangut ideographs’ names.

--
assignee: docs@python
components: Documentation, Unicode
messages: 339203
nosy: docs@python, dscorbett, ezio.melotti, vstinner
priority: normal
severity: normal
status: open
title: Bugs and inconsistencies in unicodedata
type: behavior
versions: Python 3.7

___
Python tracker 

___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com