Re: [Python-Dev] What does a double coding cookie mean?

Glenn Linderman Sat, 19 Mar 2016 10:38:53 -0700

On 3/19/2016 8:19 AM, Serhiy Storchaka wrote:

On 16.03.16 08:03, Serhiy Storchaka wrote:
On 15.03.16 22:30, Guido van Rossum wrote:
I came across a file that had two different coding cookies -- one on
the first line and one on the second. CPython uses the first, but mypy
happens to use the second. I couldn't find anything in the spec or
docs ruling out the second interpretation. Does anyone have a
suggestion (apart from following CPython)?
Reference: https://github.com/python/mypy/issues/1281
There is similar question. If a file has two different coding cookies on
the same line, what should win? Currently the last cookie wins, in
CPython parser, in the tokenize module, in IDLE, and in number of other
code. I think this is a bug.
I just tested with Emacs, and it looks that when specify differentcodings on two different lines, the first coding wins, but whenspecify different codings on the same line, the last coding wins.
Therefore current CPython behavior can be correct, and the regularexpression in PEP 263 should be changed to use greedy repetition.

Just because emacs works that way (and even though I'm an emacs user),that doesn't mean CPython should act like emacs.

(1) CPython should not necessarily act like emacs, unless the codingsyntax exactly matches emacs, rather than the generic coding thatCPython interprets, that matches emacs, vim, and other similar thingsthat both emacs and vim would ignore.(1a) Maybe if a similar test were run on vim with its syntax, and italso works the same way, then one might think it is a trend worthfollowing, but it is not clear to this non-vim user that vim syntaxallows more than one coding specification per line.

(2) emacs has no requirement that the coding be placed on the first twolines. It specifically looks at the second line only if the first linehas a “ #! ” or a “ '\" ” (for troff). (according to docs, notexperimentation)

(3) emacs also allows for Local Variables to be specified at the end ofthe file. If CPython were really to act like emacs, then it would needto allow for that too.

(4) there is no benefit to specifying the coding twice on a line, itonly adds confusion, whether in CPython, emacs, or vim.(4a) Here's an untested line that emacs would interpret as utf-8, andCPython with the greedy regulare expression would interpret as latin-1,because emacs looks only between the -*- pair, and CPython ignores that.

  # -*- coding: utf-8 -*- this file does not use coding: latin-1

_______________________________________________
Python-Dev mailing list
[email protected]
https://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
https://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] What does a double coding cookie mean?

Reply via email to