[Python-Dev] Critique of PEP 657 -- Include Fine Grained Error Locations in Tracebacks

Mark Shannon Mon, 17 May 2021 06:15:59 -0700

Hi everyone,

I fully agree with the rationale for the PEP, better error messagesalways help.However, I think the proposed implementation could be misleading in somecases and wastes memory.



Use one position, not one-and-a-half positions.
-----------------------------------------------

The main problem with PEP 657, IMO, is that it wants to present thelocation of an error as a pair of positions, but without the line numberof the second position.

Consequently it ends up with one and a half positions, not two.
This does not work well for a number of reasons.

1.  Errors spanning multiple lines.

Consider:

    (i1 + i2 +
     s1
     )

Where i1, i2 are integers and s1 is a string.

With a single location, the error points to the second `+`. PEP 657would highlight the whole line, `i1 + i2 +`, making it unclear where theerror occurred.



2. Repeated binary operations on the same line.

A single location can also be clearer when all the code is on one line.

i1 + i2 + s1

PEP 657:

i1 + i2 + s1
^^^^^^^^^^^^

Using a single location:

i1 + i2 + s1
        ^

3. Tracking locations in the compiler.

While nicer locations for errors is great, it won't be popular if it hasa negative impact on performance.Locations need to tracked through the compiler. The simpler thelocation, the easier this is to do correctly without a negativeperformance impact.It is already tricky to do this correctly with just line numbers becausewe have both source that compiles to no bytecodes (try, pass) andbytecodes that have no source (implicit return None and except cleanups).

A single location can still be presented as a whole token, as tokenizinga single line of code is easy and fast. So when presenting an error, thewhole token can be highlighted.


E.g:

NameError
    name
    ^^^^

Compression
-----------

PEP 657 proposes that no compression be used, but also mandates the useof a lossy compression scheme (truncating all offsets over 255).I think it would be better to provide an API like PEP 626 and notrestrict the internal format used. In fact, extending or replacing theAPI of PEP 626 seems the best approach to me.


I wouldn't worry about memory consumption.

The code object layout and unmarshalling process needs to bere-implemented for reduced memory use and faster startup:https://github.com/markshannon/faster-cpython/blob/master/tiers.md#tier-0

A few bytes more or less in the location table(s) is inconsequential.

Cheers,
Mark.
_______________________________________________
Python-Dev mailing list -- [email protected]
To unsubscribe send an email to [email protected]
https://mail.python.org/mailman3/lists/python-dev.python.org/
Message archived at 
https://mail.python.org/archives/list/[email protected]/message/KR7ACFCUNMHT4M7R4XNHGRFV27HZBDFD/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-Dev] Critique of PEP 657 -- Include Fine Grained Error Locations in Tracebacks

Reply via email to