PEP-8, Line Length, And All That

Thomas Passin Fri, 20 Jan 2023 22:35:27 -0800

In another thread ("Improvement to imports, what is a better way ?")there was a lot of talk about line length, PEP-8, etc. I realized thatone subject did not really come up, yet it can greatly affect the thingswe were talking about.

I'm referring to the design of the functions, methods, and classes.When they are well designed, or more likely, refactored over and overagain, they can lead to code that reads almost like pseudo-code. Here'san example from one of my little projects. You don't need to knowanything about the details of the functions or exactly what a "root" isto see what I mean.


    fileinfo = language, path, ext = getExeKind(root)
    processor = getProcessor(*fileinfo)
    runfile(path, processor, ext)

[Please don't try to guess exactly what this code needs to do or explainhow it could be done differently. That's not the point.]

In words, given a source of information (root), we can get someinformation about a file. Given that information, we can find asuitable processor for it. And given that processor, we can "run" it.

When I first put this together, the functionality was not cleanlyseparated, the various functions did more than one thing, and I had somelong lines. Each line did not necessarily convey cleanly what it was doing.

It took many iterations while I learned how to make the details workbefore I was able to see how to structure this part of the functionalityinto these three nice, clear lines.

In fact, the restructuring isn't quite finished, because the near-finalversion of runfile() does not actually use "ext" (the extension of afile) any more. Its presence is leftover from when runfile() tried todo too much.

Why did I assign the (language, path, ext) tuple to fileinfo? Becauseit was easier and shorter when used as the argument for getProcessor(),and I thought it conveyed the intent more clearly than the tuple.


Some people might think to write

processor = getProcessor(*getExeKind(root))

Oops, that doesn't expose path and ext. Well, not if we are going toavoid using a walrus operator, anyway, and if we used it, well,readability would go out the window.

In a different context, a "fluent" style can be very readable andpleasant to work with. Here are some lines from a Windows batch filethat invokes a small Python library written in a fluent style. Thefirst line defines a processing pipeline, and the second passes a datafile to it ("xy1" is the feeble name for a batch file that calls thefluent processing library):


    set process=low(30).diff().norm_last_n(%N%).write()
    type "c:\data\%1" |xy1 %process% >> %temp%

In words, we smooth the data with a LOWESS smooth using a window widthof 30, differentiate it, normalize it in a specific way (norm_last_n),and write it out (to stdout).


Not shown here, we eventually pipe it to a plotting program.

I intended from the start that this library would work in a "fluent"manner. It took a lot of iterations before I worked out how to designit so the it could be invoked in a clean, simple way by a batch file.

All this is to say that program design and refactoring can play a largerole in writing code that can be understood relatively easily, followthe style guidelines as closely as possible, and be as easy as possibleto maintain.

--
https://mail.python.org/mailman/listinfo/python-list

PEP-8, Line Length, And All That

Reply via email to