[Python-ideas] Re: python -m quality of life improvements

Soni L. Sun, 12 Jan 2020 10:57:10 -0800

None of this is even about relative imports. Absolute imports are alsobroken between them, as I tried to demonstrate using my projectstructure. The *whole* import system breaks.


On 2020-01-12 3:12 p.m., Brendan Barnwell wrote:

On 2020-01-11 23:34, Steven D'Aprano wrote:
On Sun, Jan 12, 2020 at 11:59:20AM +1100, Chris Angelico wrote:
>The biggest difference is that scripts can't do relative imports.
How is that relevent? People keep mentioning minor differences between
different ways of executing different kinds of entities (scripts,
packages, submodules etc) but not why those differences are important or
why they would justify any change in the way -m works.
I don't think I condone the details of the OP's proposal, but I doagree that the process for executing Python files has some irritatingwarts. In fact, I would say the problem is precisely that adifference exists between running a "script" and a "module". So letme explain why I think this is annoying.
The pain point is relative imports. The docs athttps://docs.python.org/3/reference/import.html#packages say:
"You can think of packages as the directories on a file system andmodules as files within directories, but don’t take this analogy tooliterally since packages and modules need not originate from the filesystem."
The basic problem is that the overwhelming majority of packagesand modules DO originate from the filesystem, and so people naturallywant to be able to use the filesystem directly to represent packagestructure, REGARDLESS OF HOW OR WHETHER THE FILES ARE RUN OR IMPORTED.I'm sorry to put that in caps but that is really the fundamentalissue. People want to be able to write something like "from . importstuff" in a file, and know that that will work purely based on thefilesystem location in which that file is situated, regardless of howthe file is "accessed" by Python (i.e., as a module, script, program,whatever you want to call it).
In other words, what non-expert users expect is that if there is adirectory called `foo` with a subdirectory `bar` with some more files,that alone should be sufficient to establish that `foo` is a packagewith `bar` as a subpackage and the other files available as moduleslike `foo.stuff` and `foo.bar.morestuff`. (Some users perhapsunderstand that the folders should have an __init__.py to beconsidered part of the package, but I think even this is less wellunderstood in the era of namespace packages.) It should not matterexactly how you "get to" these files in the first place --- that is,it should not matter whether you are importing a file or running one"as a script" or "as a module", nor should it matter precisely whichfile you run. The mere fact that a file "a.py" exists and is in thesame directory with a file called "b.py" should be enough for "a.py"to use "from . import b" and have it work, always.
Now, I realize that there are various reasons why it doesn't workthis way. Basically these reasons boil down to the fact that althoughmost packages are transparently represented by their file/directorystructure, there are also exist namespace packages, which can have amore diffuse file/directory structure, and it's also possible tocreate "virtual" packages that have no filesystem representation at all.
But the documentation is a long, long way from making this clear. For instance, it says this:
"For example, the following file system layout defines a top levelparent package with three subpackages:"
But that's not true! The filesystem layout itself does not definethe package! For relative import purposes, it only "counts" as apackage if it's imported, not if a file in it is run directly. Otherwise it's just some files on disk, and if you run one of them "asa script", no package exists as far as Python is concerned.
The documentation does go on to describe how __main__ works andhow the file's __name__ is set if it's run, and so on. But it doesall this using the term "package", which is a trap for the unwary,because they already think package means "a directory with a certainstructure" and not "something you get via the `import` statement".
Ultimately, the problem is that users (especially beginners) wantto be able to put some files in a folder and have it work as a packageas long as they are working locally in that folder --- without messingwith sys.path or "installing" anything. In other words they want tocreate a directory and put "my_script.py" in there, and then put"mylib.py" in there and have the former use relative imports to getstuff from the latter. But they can't.
Personally, I am in agreement that this behavior is extremelybothersome. (In particular, the fact that __name__ becomes __main__when the script is run, but is set to its usual name when it isimported, was a poor design decision that creates confusingasymmetries between the run and import cases.) It makes itunnecessarily difficult to write small, self-contained programs whichmake use of relative imports. Yes, it is better to write a setup.pyand specify the dependencies, and blah blah, but for small taskspeople often simply don't want to do that. They want to unzip theirfiles into a directory and have it work, without notifying Pythonabout installing anything or putting anything on the path.
As far as solutions, I think an idea worth considering would be anew command-line option similar to "-m" which effectively says "runthis FILE that I am telling you, but pretend it is in whatever packageit seems to be in based on the directory structure". So like supposethe option is -f for "file as module". It means if I do "python -fscript.py", it would run that file, but correctly set up __package__and so on so that "script.py" (and other files it imports) would beable to use relative imports. Maybe that would mean they couldunexpectedly import higher than their level (i.e., use relative-importdots going above the actual top level of the package), or maybe therelative imports would be local to the directory where "script.py" islocated, or maybe you could even specify the relative import "root" ina separate option, like "python -f script.py -r my/package/root".
The basic point is that people want to use relative importswithout including boilerplate code to put themselves on sys.path, andwithout caring about whether the file is run directly or imported as amodule, and without "installing" anything, and in general withoutthinking about anything except the local directory structure in whichthe file they are running is situated.
I realize that in many ways this is sloppy and you could say"don't do that", but I think if that is the position, thedocumentation needs to be seriously tightened up. In particular itneeds to be made clear --- at every single mention! --- that "package"refers only to something that is imported and not to a file's"identity" based on its filesystem location.
Just over six years ago I wrote an answer about this onStackOverflow(https://stackoverflow.com/questions/14132789/relative-imports-for-the-billionth-time/14132912#14132912)that continues to get upvotes and comments of the form "wow why isn'tthis explained in the documentation" almost daily. I hope it is clearthat, even if we want to leave the behavior exactly as it is, there isa major problem with how people think they can use relative importsbased on the official documentation.

_______________________________________________
Python-ideas mailing list -- [email protected]
To unsubscribe send an email to [email protected]
https://mail.python.org/mailman3/lists/python-ideas.python.org/
Message archived at 
https://mail.python.org/archives/list/[email protected]/message/QKT2LZOZFY6M4VZBIITBBI64B5FB6LDD/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-ideas] Re: python -m quality of life improvements

Reply via email to