[issue34053] Support localization of unicode descriptions

2019-02-19 Thread Pander


Pander  added the comment:

As discussed on python-ideas, created scripts at 
https://github.com/OpenTaal/python-unicodedata_l10n that provide a starting 
point.

--

___
Python tracker 
<https://bugs.python.org/issue34053>
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue34053] Support localization of unicode descriptions

2018-07-05 Thread Pander


Pander  added the comment:

Thanks, posted it at 
https://groups.google.com/forum/#!topic/python-ideas/g2jj4WRVDFA

--

___
Python tracker 
<https://bugs.python.org/issue34053>
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue34053] Support localization of unicode descriptions

2018-07-05 Thread Pander

New submission from Pander :

Please, support localization for Unicode block description and character 
description.

Translations are available from 
https://github.com/unicode-table/unicode-table-data/tree/master/loc If 
possible, use a gettext approach similar to https://pypi.org/project/pycountry/

Implementing this feature will allow users to read Unicode descriptions in 
their own language, other than English.

For example, now is possible only in English:

from unicodedata import name
print(name('ß'))
LATIN SMALL LETTER SHARP S

So unicodedata could provide a way to translate LATIN SMALL LETTER SHARP S to 
e.g. German with:

from unicodedata import name, LOCALED_DIR
from gettext import translation
german = translation('UnicodeData' LOCALED_DIR, languages=['de'])
german.install()
print(_(name('ß')))
LATEINISCHER KLEINBUCHSTABE SCHARFES S

and something similar for unicodedata.category

--
components: Unicode
messages: 321095
nosy: PanderMusubi, ezio.melotti, vstinner
priority: normal
severity: normal
status: open
title: Support localization of unicode descriptions
type: enhancement
versions: Python 3.6, Python 3.7, Python 3.8

___
Python tracker 
<https://bugs.python.org/issue34053>
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue6331] Add unicode script info to the unicode database

2018-06-20 Thread Pander


Pander  added the comment:

Since June 2018, Unicode version 11.0 is out. Perhaps that could help move this 
forward.

--

___
Python tracker 
<https://bugs.python.org/issue6331>
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16684] Unicode property value abbreviated names and long names

2018-06-20 Thread Pander


Pander  added the comment:

Since June 2018, Unicode version 11.0 is out. Perhaps that could help move this 
forward.

--

___
Python tracker 
<https://bugs.python.org/issue16684>
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16684] Unicode property value abbreviated names and long names

2017-01-11 Thread Pander

Pander added the comment:

Any updates or ideas on how to move this forward? Meanwhile, should the issue 
relate to version 3.6? Thanks. Ah, see also https://bugs.python.org/issue6331 
please

--

___
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue16684>
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue6331] Add unicode script info to the unicode database

2017-01-11 Thread Pander

Pander added the comment:

Any updates or ideas on how to move this forward? See also 
https://bugs.python.org/issue16684 Thanks.

--

___
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue6331>
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue6331] Add unicode script info to the unicode database

2014-03-23 Thread Pander

Pander added the comment:

I see the patch support Unicode scripts 
https://en.wikipedia.org/wiki/Script_%28Unicode%29 but I am also interested in 
support for Unicode blocks https://en.wikipedia.org/wiki/Unicode_block

Code for support for the latter is at https://github.com/nagisa/unicodeblocks

I could ont quiet make out of the patch also supports Unicode blocks. If not, 
shoudl that be requested in a separete issue?

Furthermore, support for Unicode scripts and blocks should be updated each time 
a new version of Unicode standard is published. Someone should check of the 
latest patch should be updated to the latest version of Unicode. Not only for 
this issue but for each release of PYthon.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6331
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16684] Unicode property value abbreviated names and long names

2012-12-23 Thread Pander

Pander added the comment:

Attached is the requested proof-of-concept script.

--
Added file: 
http://bugs.python.org/file28405/create-unicodedata-dicts-prop-value-alias-20121223.py

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16684
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue6331] Add unicode script info to the unicode database

2012-12-14 Thread Pander

Pander added the comment:

Please, also consider reviewing functionality offered by:
  http://pypi.python.org/pypi/unicodescript/
and
  http://pypi.python.org/pypi/unicodeblocks/
which could be used to improve and extend the proposed patch.

--
nosy: +PanderMusubi

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6331
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16681] Documentation 'bidirectional category' should be 'bidirectional class' in unicodedata package

2012-12-14 Thread Pander

New submission from Pander:

Documentation in
  docs.python.org/3/library/unicodedata.html
on 'bidirectional category' should be 'bidirectional class' in unicodedata 
package. Please see
  www.unicode.org/Public/UNIDATA/PropertyValueAliases.txt
where only bidirectional class is being referred to.

--
components: Unicode
messages: 177473
nosy: PanderMusubi, ezio.melotti
priority: normal
severity: normal
status: open
title: Documentation 'bidirectional category' should be 'bidirectional class' 
in unicodedata package
type: enhancement
versions: Python 3.5

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16681
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16684] Unicode property value abbreviated names and long names

2012-12-14 Thread Pander

New submission from Pander:

The package unicodedata
  http://docs.python.org/3/library/unicodedata.html
offers looking up of property values in terms of general category, 
bidirectional class and east asian width for Unicode characters
  unicodedata.category(unichr)
  unicodedata.bidirectional(unichr)
  unicodedata.east_asian_width(chr)

The abbreviated name of the specific category is returned. However, for certain 
applications it is important to be able to get the from abbreviated name to the 
long name and vice versa.

The data needed to do this can be found at
  http://www.unicode.org/Public/UNIDATA/PropertyValueAliases.txt
under sections
  # General_Category (gc)
  # Bidi_Class (bc)
  # East_Asian_Width (ea)
Use only the second (abbreviated name) and third (long name) fields and 
ignoring other fields and possible comments.

For general category, also support translation back and forth of the one-letter 
abbreviations which are groups representing two-letter general categories 
abbreviations with the same initial letter.

Please extend this package with a way of translating back and forth between 
abbreviated name and long name for property values defined in Unicode for 
general category, bidirectional class and East Asian width. This functionality 
should be independent of retrieving the abbreviated names for Unicode character 
as is available now and should be accessible via separate methods or 
dictionaries in which developers can perform lookups themselves.

Implementing the functionality requested in this issue allows Python developers 
to get from an abbreviated property value to a meaningful property value name 
and vice versa without having to retrieve this information from the Unicode 
Consortium and/or shipping this information with their code with the risk of 
using outdated information.

--
components: Unicode
messages: 177476
nosy: PanderMusubi, ezio.melotti
priority: normal
severity: normal
status: open
title: Unicode property value abbreviated names and long names
type: enhancement
versions: Python 3.5

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16684
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue6331] Add unicode script info to the unicode database

2012-12-14 Thread Pander

Pander added the comment:

The latest version of the respective sources can be found here:
  https://github.com/ConradIrwin/unicodescript
and here:
  https://github.com/simukis/unicodeblocks

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6331
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16684] Unicode property value abbreviated names and long names

2012-12-14 Thread Pander

Pander added the comment:

I myself have a lot of Python applications that process font files and interact 
with fonttools and FontForge, which are both written in Python too. As you also 
have your own dict for this purpose and probably other people too, it would be 
justified to add these three small dicts in the standard lib. Especially since 
this package in the standard lib follows the definitions from Unicode 
Consortium.

When this is shipped in one package developers will always have an in sync 
translation from abbreviated names to long names and vice versa. Over the last 
years I needed to adjust my dicts regularly for the added definitions by 
Unicode Consortium which are supported by stdlib.

At the moment, translation from Unicode codes U+1234 to human-readable Unicode 
names and vice versa is offered at the moment. Providing human-readable names 
for the property values is a service of the same level and will be catering to 
approximately the same user group.

If you agree that these dicts can be added I am willing to provide a script 
that will parse the aforementioned file.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16684
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue5894] Lookup of localised language name by ISO 639 language code and reverse look up

2009-05-01 Thread pander

New submission from pander pan...@users.sourceforge.net:

Hi all,

Does a module exist that can look up localised language names for ISO
639 language codes? If not, could one be constructed according to (sort
of) the following specifications?

1) Look up a localised language name for an ISO 639-1 language code for
a required parameter named code, providing the language code, and an
optional parameter named lang, providing the output language code, which
by default is set to 'en'. Example input and output would would be:

iso6391name(code='en') = 'English'
iso6391name(code='zu', lang='en') = 'Zulu'
iso6391name(code='zu', lang='nl') = 'Zoeloe'
iso6391name(code='nl', lang='en') = 'Dutch'

2) Reverse lookup of 1) from localised language to language code, e.g.:
iso6391code(name='English') = 'en'
iso6391code(name='Zulu', lang='en') = 'zu'
iso6391code(name='Zoeloe', lang='nl') = 'zu'
iso6391code(name='Dutch', lang='en') = 'nl'

3) Look up native language name for an ISO 639-1 language code for a
required parameter named code, providing the language code. Example
input and output would would be:

iso6391native(code='en') = 'English'
iso6391native(code='nl) = 'Nederlands'

4) Retrieving a tuple with all language codes. Example output would be:

iso6391codes() = ('aa', 'ab', ..., 'zu')

5) Retrieving a tuple with all localised language names for an optional
parameter named lang, providing the output language code, which by
default is set to 'en'. Example input and output would be:

iso6391names() = ('Afar', 'Abkhazian', ..., 'Zulu')
iso6391names(lang='nl') = ('Afar', 'Abchazisch', ..., 'Zoeloe')

6), 7), 8), 9), 10) Something similar for ISO 639-2 language codes

11), 12), 13), 14), 15) Something similar for ISO 639-2 language codes

sources:
http://en.wikipedia.org/wiki/List_of_ISO_639-1_codes
http://nl.wikipedia.org/wiki/Lijst_van_ISO_639-1-codes

I have a very simple implementation for English and Dutch but figured
that such a module should be available for all developers in all
languages. Hence this request to build something from scratch in the
Python localisation community.

Implementing functionality as described above will boost the ease of
internalisation and localisation of Python applications.

Please reply to this issue for constructive feedback.

Thanks,

Pander

PS   The original post in i18n-sg is here:
  http://mail.python.org/pipermail/i18n-sig/2009-April/002117.html

--
messages: 86867
nosy: pander
severity: normal
status: open
title: Lookup of localised language name by ISO 639 language code and reverse 
look up
type: feature request

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue5894
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com