[issue34053] Support localization of unicode descriptions
Pander added the comment: As discussed on python-ideas, created scripts at https://github.com/OpenTaal/python-unicodedata_l10n that provide a starting point. -- ___ Python tracker <https://bugs.python.org/issue34053> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue34053] Support localization of unicode descriptions
Pander added the comment: Thanks, posted it at https://groups.google.com/forum/#!topic/python-ideas/g2jj4WRVDFA -- ___ Python tracker <https://bugs.python.org/issue34053> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue34053] Support localization of unicode descriptions
New submission from Pander : Please, support localization for Unicode block description and character description. Translations are available from https://github.com/unicode-table/unicode-table-data/tree/master/loc If possible, use a gettext approach similar to https://pypi.org/project/pycountry/ Implementing this feature will allow users to read Unicode descriptions in their own language, other than English. For example, now is possible only in English: from unicodedata import name print(name('ß')) LATIN SMALL LETTER SHARP S So unicodedata could provide a way to translate LATIN SMALL LETTER SHARP S to e.g. German with: from unicodedata import name, LOCALED_DIR from gettext import translation german = translation('UnicodeData' LOCALED_DIR, languages=['de']) german.install() print(_(name('ß'))) LATEINISCHER KLEINBUCHSTABE SCHARFES S and something similar for unicodedata.category -- components: Unicode messages: 321095 nosy: PanderMusubi, ezio.melotti, vstinner priority: normal severity: normal status: open title: Support localization of unicode descriptions type: enhancement versions: Python 3.6, Python 3.7, Python 3.8 ___ Python tracker <https://bugs.python.org/issue34053> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue6331] Add unicode script info to the unicode database
Pander added the comment: Since June 2018, Unicode version 11.0 is out. Perhaps that could help move this forward. -- ___ Python tracker <https://bugs.python.org/issue6331> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue16684] Unicode property value abbreviated names and long names
Pander added the comment: Since June 2018, Unicode version 11.0 is out. Perhaps that could help move this forward. -- ___ Python tracker <https://bugs.python.org/issue16684> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue16684] Unicode property value abbreviated names and long names
Pander added the comment: Any updates or ideas on how to move this forward? Meanwhile, should the issue relate to version 3.6? Thanks. Ah, see also https://bugs.python.org/issue6331 please -- ___ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue16684> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue6331] Add unicode script info to the unicode database
Pander added the comment: Any updates or ideas on how to move this forward? See also https://bugs.python.org/issue16684 Thanks. -- ___ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue6331> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue6331] Add unicode script info to the unicode database
Pander added the comment: I see the patch support Unicode scripts https://en.wikipedia.org/wiki/Script_%28Unicode%29 but I am also interested in support for Unicode blocks https://en.wikipedia.org/wiki/Unicode_block Code for support for the latter is at https://github.com/nagisa/unicodeblocks I could ont quiet make out of the patch also supports Unicode blocks. If not, shoudl that be requested in a separete issue? Furthermore, support for Unicode scripts and blocks should be updated each time a new version of Unicode standard is published. Someone should check of the latest patch should be updated to the latest version of Unicode. Not only for this issue but for each release of PYthon. -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue6331 ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue16684] Unicode property value abbreviated names and long names
Pander added the comment: Attached is the requested proof-of-concept script. -- Added file: http://bugs.python.org/file28405/create-unicodedata-dicts-prop-value-alias-20121223.py ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue16684 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue6331] Add unicode script info to the unicode database
Pander added the comment: Please, also consider reviewing functionality offered by: http://pypi.python.org/pypi/unicodescript/ and http://pypi.python.org/pypi/unicodeblocks/ which could be used to improve and extend the proposed patch. -- nosy: +PanderMusubi ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue6331 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue16681] Documentation 'bidirectional category' should be 'bidirectional class' in unicodedata package
New submission from Pander: Documentation in docs.python.org/3/library/unicodedata.html on 'bidirectional category' should be 'bidirectional class' in unicodedata package. Please see www.unicode.org/Public/UNIDATA/PropertyValueAliases.txt where only bidirectional class is being referred to. -- components: Unicode messages: 177473 nosy: PanderMusubi, ezio.melotti priority: normal severity: normal status: open title: Documentation 'bidirectional category' should be 'bidirectional class' in unicodedata package type: enhancement versions: Python 3.5 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue16681 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue16684] Unicode property value abbreviated names and long names
New submission from Pander: The package unicodedata http://docs.python.org/3/library/unicodedata.html offers looking up of property values in terms of general category, bidirectional class and east asian width for Unicode characters unicodedata.category(unichr) unicodedata.bidirectional(unichr) unicodedata.east_asian_width(chr) The abbreviated name of the specific category is returned. However, for certain applications it is important to be able to get the from abbreviated name to the long name and vice versa. The data needed to do this can be found at http://www.unicode.org/Public/UNIDATA/PropertyValueAliases.txt under sections # General_Category (gc) # Bidi_Class (bc) # East_Asian_Width (ea) Use only the second (abbreviated name) and third (long name) fields and ignoring other fields and possible comments. For general category, also support translation back and forth of the one-letter abbreviations which are groups representing two-letter general categories abbreviations with the same initial letter. Please extend this package with a way of translating back and forth between abbreviated name and long name for property values defined in Unicode for general category, bidirectional class and East Asian width. This functionality should be independent of retrieving the abbreviated names for Unicode character as is available now and should be accessible via separate methods or dictionaries in which developers can perform lookups themselves. Implementing the functionality requested in this issue allows Python developers to get from an abbreviated property value to a meaningful property value name and vice versa without having to retrieve this information from the Unicode Consortium and/or shipping this information with their code with the risk of using outdated information. -- components: Unicode messages: 177476 nosy: PanderMusubi, ezio.melotti priority: normal severity: normal status: open title: Unicode property value abbreviated names and long names type: enhancement versions: Python 3.5 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue16684 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue6331] Add unicode script info to the unicode database
Pander added the comment: The latest version of the respective sources can be found here: https://github.com/ConradIrwin/unicodescript and here: https://github.com/simukis/unicodeblocks -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue6331 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue16684] Unicode property value abbreviated names and long names
Pander added the comment: I myself have a lot of Python applications that process font files and interact with fonttools and FontForge, which are both written in Python too. As you also have your own dict for this purpose and probably other people too, it would be justified to add these three small dicts in the standard lib. Especially since this package in the standard lib follows the definitions from Unicode Consortium. When this is shipped in one package developers will always have an in sync translation from abbreviated names to long names and vice versa. Over the last years I needed to adjust my dicts regularly for the added definitions by Unicode Consortium which are supported by stdlib. At the moment, translation from Unicode codes U+1234 to human-readable Unicode names and vice versa is offered at the moment. Providing human-readable names for the property values is a service of the same level and will be catering to approximately the same user group. If you agree that these dicts can be added I am willing to provide a script that will parse the aforementioned file. -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue16684 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue5894] Lookup of localised language name by ISO 639 language code and reverse look up
New submission from pander pan...@users.sourceforge.net: Hi all, Does a module exist that can look up localised language names for ISO 639 language codes? If not, could one be constructed according to (sort of) the following specifications? 1) Look up a localised language name for an ISO 639-1 language code for a required parameter named code, providing the language code, and an optional parameter named lang, providing the output language code, which by default is set to 'en'. Example input and output would would be: iso6391name(code='en') = 'English' iso6391name(code='zu', lang='en') = 'Zulu' iso6391name(code='zu', lang='nl') = 'Zoeloe' iso6391name(code='nl', lang='en') = 'Dutch' 2) Reverse lookup of 1) from localised language to language code, e.g.: iso6391code(name='English') = 'en' iso6391code(name='Zulu', lang='en') = 'zu' iso6391code(name='Zoeloe', lang='nl') = 'zu' iso6391code(name='Dutch', lang='en') = 'nl' 3) Look up native language name for an ISO 639-1 language code for a required parameter named code, providing the language code. Example input and output would would be: iso6391native(code='en') = 'English' iso6391native(code='nl) = 'Nederlands' 4) Retrieving a tuple with all language codes. Example output would be: iso6391codes() = ('aa', 'ab', ..., 'zu') 5) Retrieving a tuple with all localised language names for an optional parameter named lang, providing the output language code, which by default is set to 'en'. Example input and output would be: iso6391names() = ('Afar', 'Abkhazian', ..., 'Zulu') iso6391names(lang='nl') = ('Afar', 'Abchazisch', ..., 'Zoeloe') 6), 7), 8), 9), 10) Something similar for ISO 639-2 language codes 11), 12), 13), 14), 15) Something similar for ISO 639-2 language codes sources: http://en.wikipedia.org/wiki/List_of_ISO_639-1_codes http://nl.wikipedia.org/wiki/Lijst_van_ISO_639-1-codes I have a very simple implementation for English and Dutch but figured that such a module should be available for all developers in all languages. Hence this request to build something from scratch in the Python localisation community. Implementing functionality as described above will boost the ease of internalisation and localisation of Python applications. Please reply to this issue for constructive feedback. Thanks, Pander PS The original post in i18n-sg is here: http://mail.python.org/pipermail/i18n-sig/2009-April/002117.html -- messages: 86867 nosy: pander severity: normal status: open title: Lookup of localised language name by ISO 639 language code and reverse look up type: feature request ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue5894 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com