Re: Python Statements/Keyword Localization

Terry Reedy Wed, 25 Nov 2009 13:28:54 -0800

Emanuele D'Arrigo wrote:

Greetings everybody,


some time ago I saw a paper that used an XSL transformation sheet to
transform (if I remember correctly) a Chinese xml file (inclusive of
Chinese-script XML tags) into an XHTML file.

More recently you might have all heard how the ICANN has opened up the
way for non-latin characters in domain names, so that we'll soon start
seeing URLs using Russian, Asian and Arabic characters.

In this context I was wondering if there has ever been much thought
about a mechanism to allow the localization not only of the strings
handled by python but also of its built-in keywords, such as "if",
"for", "while", "class" and so on.

There have been various debates and discussions on the topic. There hasbeen slow movement away from ascii-only in user code. (But not in thestdlib, nor will there be there.)

1. Unicode data type.
2. Unicode allowed in comment and string literals.

This required input decoding and coding cookie. This lead, I believesomewhat accidentally, to3. Extended ascii (high bit set, for other European chars in variousencodings) for identifiers.

4 (In 3.0) unicode allowed for identifiers

Here is a version of the anti-customized-keyword position. Python isdesigned to be read by people. Currently, any programmer in the worldcan potentially read any Python program. The developers, especiallyGuido, like this. Fixed keywords are not an undue burden because anyeducated person should learn to read Latin characters a-z,0-9. andPython has an intentionally short list that the developers are loath tolengthen.

Change 4 above inhibits universal readability. But once 3 happened andstr became unicode, in 3.0, it was hard to say no to this.

A 'pro' argument: Python was designed for learning and is good for thatand *is* used in schools down to the elementary level. But kids cannotbe expected to know foreign alphabets and words whill still learningtheir own.


> For example, the following English-

based piece of code:

class MyClass(object):
    def myMethod(self, aVariable):
         if aVariable == True:
            print "It's True!"
         else:
            print "It's False!"

would become (in Italian):

classe LaMiaClasse(oggetto):
    def ilMioMetodo(io, unaVariabile)
         se unaVariabile == Vero:
             stampa "E' Vero!"
         altrimenti:
             stampa "E' Falso!"

I can imagine how a translation script going through the source code
could do a 1:1 keyword translation to English fairly quickly but this
would mean that the runtime code still is in English and any error
message would be in English.

This is currently seen as a reason to not have other keywords: it willdo no good anyway. A Python programmer must know minimal English and thekeywords are the least of the problem.

I can imagine that there could be a mechanism for extracting andreplacing error messages with translations, like there is for Pythoncode, but I do not know if it will even happen with haphazard volunteerwork or will require grant sponsorship.

I can also imagine that it should be
possible to "simply" recompile python to use different keywords, but
then all libraries using the English keywords would become
incompatible, wouldn't they?

In this context it seems to be the case that the executable would have
to be able to optionally accept -a list- of dictionaries to internally
translate to English the keywords found in the input code and at most -
one- dictionary to internally translate from English output messages
such as a stack trace.

What do you guys think?

I would like anyone in the world to be able to use Python, and I wouldlike Python programmers to potentially be able to potentially read anyPython code and not have the community severely balkanized. To me, thiswould eventually mean both native keywords and tranliteration from otheralphabets and scripts to latin chars. Not an easy project.


Terry Jan Reedy

--
http://mail.python.org/mailman/listinfo/python-list

Re: Python Statements/Keyword Localization

Reply via email to