Re: [Python-Dev] PEP 399: Pure Python/C Accelerator Module Compatibiilty Requirements

Terry Reedy Wed, 06 Apr 2011 19:44:31 -0700

On 4/6/2011 2:54 PM, Terry Reedy wrote:

I believe that at the time of that decision, the Python [heapq] code was only
intended for humans, like the Python (near) equivalents in the itertools
docs to C-coded itertool functions. Now that we are aiming to have
stdlib Python code be a reference implementation for all interpreters,
that decision should be revisited.


OK so far.

> Either the C code should be generalized to sequences or

> the Python code specialized to lists, making sure the doc matcheseither way.

After rereading the heapq doc and .py file and thinking some more, Iretract this statement for the following reasons.

1. The heapq doc clearly states that a list is required. It leaves thebehavior for other types undefined. Let it be so.

2. Both _heapq.c (or its actual name) and heapq.py meet (I presume) thedocumented requirements and pass (or would pass) a complete test suitebased on using lists as heaps. In that regard, both are conformant andshould be considered 'equivalent'.

3. _heapq.c is clearly optimized for speed. It allows a list subclass asinput and will heapify such, but it ignores a custom __getitem__. Myinformal test on the result of random.shuffle(list(range(9999999) showsthat heapify is over 10x as fast as .sort(). Let it be so.

4. When I suggested changing heapq.py, I had forgetten that heap.pydefined several functions rather than a wrapper class with methods. Iwas thinking of putting a type check in .__init__, where it would beapplied once per heap (and possibly bypassed), and could easily beremoved. Instead every function would require a type check for everycall. This would be too obnoxious to me. I love duck typing and held mynose a bit when suggesting a one-time type check.

5. Python already has an "extra's allowed" principle. In other words, animplementation does not have to bother to enforce documentedrestrictions. For one example, Python 2 manuals restrict identifiers toascii letters. CPython 2 (at least in recent versions) actually allowsextended ascii letters, as in latin-1. For another, namespaces (globalsand attribute namespaces), by their name, only need to map identifiersto objects. However, CPython uses general dicts rather than specializedstring dicts with validity checks. People have exploited both loopholes.But those who have should not complain to us if such code fails on adifferent implementation that adheres to the doc.

I think the Language and Library references should start with somethinga bit more specific than at present:

"The Python x.y Language and Library References define the Python x.ylanguage, its builtin objects, and standard library. Code written tothese docs should run on any implementation that includes the featuresused. Code that exploits or depends on any implementation-specificfeature or behavior may not be portable."

_x.c and x.py are separate implementations of module x. I think theyshould be subject to the same disclaimer.

Therefore, I currently think that the only change needed for heapq(assuming both versions pass complete tests as per the doc) is anexplanation at the top of heapq.py that goes something like this:

"Heapq.py is a reference implementation of the heapq module for bothhumans and implementations that do not have an accelerated version. ForCPython, most of the functions are replaced by much faster C-coded versions.

Heapq is documented to required a python list as input to the heapfunctions. The C functions enforce this restriction. The Python versionsdo not and should work with any mutable random-access sequence. Shouldyou wish to run the Python code with CPython, copy this file, give it anew name, delete the following lines:


try:
    from _heapq import *
except ImportError:
    pass

make any other changes you wish, and do not expect the result to beportable."


--
Terry Jan Reedy

_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] PEP 399: Pure Python/C Accelerator Module Compatibiilty Requirements

Reply via email to