[Python-Dev] Critique of PEP 657

Mark Shannon Wed, 30 Jun 2021 07:00:30 -0700

Hi,

Apologies for my tardiness in doing this, but no one explicitly said itwas too late to critique PEP 657...



Critique of PEP 657 (Include Fine Grained Error Locations in Tracebacks)
------------------------------------------------------------------------

First of all I want to say that I support the goal of improving errormessages. IMO the PEP should be "accepted in principle". I think all ofthe issues below can be fixed while still supporting the general aims ofthe PEP.



The change from points to ranges as locations
---------------------------------------------

Because Python is a procedural language, there is an expectation thatcode executes in a certain order. PEP 626 (Precise line numbers fordebugging and other tools) seeks to guarantee that that expectation is met.

PEP 657 describes how locations for exceptions are to be handled, but isvague on the treatment of locations for tracing, profiling and debugging.

PEP 657 proposes that locations for exceptions be treated as ranges,whereas tracing, profiling and debugging currently treat locations aspoints.

Either this will end in contradictions and confusion should thoselocations disagree, or the locations for tracing, profiling anddebugging must change.

Using the start of a range as the point location for tracing may bemisleading when the operation that causes an exception is on a differentline within that range.


Consider this example:
https://github.com/python/cpython/blob/main/Lib/test/test_compile.py#L861

This might seem like a contrived case, but it is based on a real bugreport https://bugs.python.org/issue39316


1.  def load_method():
2.      return (
3.          o.
4.          m(
5.              0
6.          )

Currently the call is traced on line 4.

PEP 657 would change the location of the call from line 4 to the range3-6, which would mean that the line of call is no longer tracedseparately (or traced several times). PEP 657 makes no mention of thischange.

The PEP claims that these changes are improvements. Maybe they are, butthey are quite impactful changes which the PEP glosses over. The impacton tools like coverage.py and debuggers should be made clearer. Forexample, how would one set a breakpoint on line 4 above?

There are other languages (e.g. jinja templates) that compile to PythonAST and bytecode. These *might* produce locations that overlap, but arenot nested. The behavior of tracing and debuggers needs to be describedfor those locations.


Backwards Compatibility
-----------------------

PEP 657 claims it is fully backwards compatible, but it cannot be bothbackwards compatible and consistent.There are fundamental differences between using ranges and points aslocations.


Impact on startup time
----------------------

The PEP 657 suggests the impact on startup would be negligible. That isnot quite true. The impact on startup is probably acceptable, but aproper analysis needs to be made.

The increase in size of pyc files ~20% puts an upper bound on theincrease of startup time, but I would expect it to be much less thanthat as loading files from disk is only a fraction of startup.

Currently, startup is dominated by inefficiencies in interpretercreation, unmarshalling and module loading.We plan to reduce these a lot for 3.11, so that the impact of PEP 657 onstartup will be larger (as a ratio) than experiments with 3.10 suggest.


The API
-------

The C API adds three new functions, one each for the end line, startcolumn and end column.This is either slow, as any compressed table needs to be parsed fourtimes, or space inefficient using an uncompressed table.


Opt-out
-------

Allowing opt-out prevents consistent compression of location data,resulting in larger pyc files for those that do not opt-out.The exact semantics, in terms of error formatting, tracing, etc is notdescribed should the user opt-out.


Summary
-------

Overall, there is nothing that blocks acceptance of the PEP inprinciple, but there are quite a few issues that need resolving.



Suggestions
-----------

1. Clarify, in detail, the impact on line-based tools like profilers,coverage.py and debuggers. This should include help on how to use thenew APIs and where using the old APIs might result in behavioral changes.


2. Change the C API to a single function:

int PyCode_Addr2Location(PyCodeObject *co, int addr, int *startline, int*startcolumn, int *endline, int *endcolumn)


3. Drop the opt-out option.

If the extra information is optional, then the compression scheme mustallow for that; making the code more complex and potentially lessefficient. Does opting out use the start of the range, or the old line,as the location?


4. Drop the limitation on column offsets.

The data needs to be compressed anyway, so allowing arbitrary columnoffsets is effectively free.

6. Store all location information in a single table (this applies moreto the implementation than the PEP)Using four separate objects to hold the location info adds a lot ofoverhead for most functions.



Cheers,
Mark.

_______________________________________________
Python-Dev mailing list -- python-dev@python.org
To unsubscribe send an email to python-dev-le...@python.org
https://mail.python.org/mailman3/lists/python-dev.python.org/
Message archived at 
https://mail.python.org/archives/list/python-dev@python.org/message/XNSFU7NTF3EWFQJEGTLCIFNX23BCF7QR/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-Dev] Critique of PEP 657

Reply via email to