Re: [caiman-discuss] AI Multi-homed Final Code Review

Keith Mitchell Tue, 10 Aug 2010 10:19:23 -0700

 On 08/ 9/10 11:51 AM, [email protected] wrote:

Hi Keith,

Thank you for the sanity-check. More below and to address somecomments which came up in talking with you on the phone and in e-mail:

1) The doctest implies one must overload open() to use WANBootInfo(). To
   prevent any confused reading of the doctest, I've explictily commented
   that overloading open() is only for testing. Further, I've tried using
   open("/some_file_which_does_not_exist") however, doctest seems to
   execute in Python restricted mode so open()/file() are not available.

It's good to know that about doctest (though it makes sense, in a way,that it would be restricted). The comment will work.


3) For those who might belive a class for "querying" wanboot.conf might
   also allow writting, I've made an explicit comment that the class does

not allow writting. I tried setting __setattr__ to None but thatcaused

   other issues which I could not eaisliy resolve.

The comment is valuable; long term, defining __slots__[1] at the classlevel should prevent any attempts to write arbitrary attributes (but thecomment is good enough for now).


[1]http://docs.python.org/reference/datamodel.html#slots

                            Thank you,
                            Clay

On Fri, 30 Jul 2010, Keith Mitchell wrote:

[snip...]
All my prior comments on shell scripts have been addressed. Allcomments below are from ai_sd.py, which seems to have gotten rathercomplicated suddenly.
120,129: Should this be... "method"? (Why is method listed at all?It's right above the line, and any pydoc parser should have areference to the method name)
This was a carry-over I didn't pay it much attention to; but, indeedthese are misspelled and redundant. I've removed them.
405, 418: platform.processor() should provide a more parseableresult, particularly for sparc.
Ah yes, indeed. Thank you, I'll use processor().
469-509: Would probably be best to move this doctest to a separatePyUnit test. It's a bit verbose to read which significantly reducesits value as a test-that-also-documents-usage - ditto for the__getattr__ function's doctest. Given what's being tested - theopening of a file - there may not really be a need to test this. Itrust that the implementation of "open" in Python works.
Ah yes, but the API usage to understand what err and f are and when isuseful to know IMO. If anyone doesn't want to see it, even vimsupports code folds to hid the doctest - let's keep code and theirtests in one place for now.

I don't particularly like to rely on folding to hide the significantnumber of "noise" lines (especially since a fold will hide everything inthe docstring), but I suppose that it will be fine. I think I'm toopicky these days...

558-559: Between these lines, and the previous function, that's a lotof effort to: open a file, catch the error, tell the caller there wasan error, then have the caller re-raise an error. Let Python'sbuiltin exceptions do what they're made for, and just open the filedirectly (using a "with" block to automatically close it):
with open(a_file) as the_file:
   my_data = the_file.readlines()
   # process my_data, etc.
#(File closes automatically at end of 'with' block, much like afinally clause)
Yes, but this keeps the actual implementation of the service discoverycode consistent be it SPARC or X86 (lines 400-423) which I think ismore important. (I'd rather abstract out the differences and present aconsistent API than have architecture specific checks for IOError,etc; here if this class has an error it will propagate it and thecalling code can then report it as it wishes.

While I wasn't necessarily implying to do that all "in-line" (within theconcept of a platform-generic function or class was my thought), it'snot enough of a sticking point that I wish to continue pressing on it. Ido feel that the nested exception is harder for a caller to parse, butnot significantly so in this case given the usage.

563-564, 576: A __getattr__ function that reads a file twice is veryexpensive for attribute access. I appreciate the magic of Python usedhere, but given the limited usage of this class, it seems a bitexpensive. A simple method (rather than a class), that reads the fileand returns a dictionary of the key/value pairs found in the filewould probably be far more efficient and valuable. If it makes itmore natural to store that data in a class that has a "read fromfile" function, then that'd be ok too.
If a full fledged class is required, perhaps the data could be cachedin __dict__, and only re-read the file if the file has been updatedsince the last read. (os.stat(filename).st_mtime)
Yes, this is not very efficient of me; I've simplified the function,however, I still believe cacheing is a bit overkill since at this timethe function is only called once anyways.

That's fine. I do have a tendency to extrapolate and look for a "bettersolution" when one is not yet needed. But keep the concept in mind forfuture reference!

Additionally, regardless of implementation, use str.partition ratherthan split. It's a far 'safer' function for cases of the form"key=value", as it will always return a tuple of length 3. Also, isit truly technically invalid to have multiple entries of the samename in a wanboot.conf file? bootconfchk(1M) seems to accept it, sofor consistency, if it's valid, we should as well.
Hrm, that's an interesting question. I hadn't realized that it wouldaccept multiple values. For the places we use this, it can certainlycause us issues. (For example multiple root_server keys causes my fixto net-fs-root to try to download solaris.zlib, et al. multiple timestoo). As we generate the wanboot.conf file at this time; this is apotential situation but one I see as very unlikely. I've filed a bug16751 - "Multiple value per key in wanboot.conf causes AI clientsconfusion" to track this.


Sounds good.

580: If this stays as a class, please wrap this into the classdefinition (setting file=None, and then setting wanboot_conf =WANBootInfo.default_wanboot_conf when file is None should do it).Also, use "file_" rather than overriding the builtin "file"(similarly, for readability above when creating a file, use "file_" -or a more descriptive name - rather than "f").
I've added a doc string mention that the file still acquires a valuedespite being left as None. Certainly since Drew too stumbled on theout-of-class indentation level init() I think you're right, thisshould stay with the class.


Thanks.

590/600: Perhaps just create a function?
This is to keep parity across SPARC and X86 for discoveryingproperties; so WANBootInfo and DHCPClientInfo are reasonably similarin API.

There's a very distinct difference between the two classes in that forone, you create an instance and access an attribute, and for the other,you're not allowed to create an instance. If this is meant to be aconsistent API, then DHCPClientInfo.boot_server perhaps should be"root_server" to match WANBootInfo.root_server. The DHCPClientInfoshould probably also be able to be instantiated and r/boot_server shouldbe a property.

(Those are just nits though, or comments to file away to take note of asthese classes evolve)

655-680: Expanding on Drew's comments on use of Popen - if it raisesan OSError, that means a required program (dhcpinfo) is missing; ifit's missing than either there's something corrupt on theimage/system, or there's an IPS dependency error. Both are situationsthat the ai_sd can't do anything about; so the best course of actionis to propagate up (at the highest level, e.g in the if __name__ ==__main__ block, all exceptions should be logged and printed to theuser with as much information as possible to assist in evaluating theroot cause). Masking the OSError is a bad idea - let it show through,as that will be easier to diagnose if the situation ever arises, andhas far richer data than the DHCPError will.
No doubt there are some good ideas here to aid in debugging. However,the code is already structured to use an AI_LOG device and reportPopen OSErrors this way. But you have a very valid point I should passthe OSError's message through to aid in diagnosis (does dhcpinfo(1)not exist, have restrictive permissions, etc.?).
I still believe that this should be catchable as coming from theDHCPClientInfo class, as if the consumer isn't requiring the DHCPinfo, it may wish to simply note and continue; if it wishes topropagate it as long as the necessary info is there it can pass italong to the user (as ai_sd.py does). I think it's good hygiene forthe class to report that it had an issue, not potentially cause ahigher-level consumer to believe it had an issue (i.e. each layershould catch its own errors -- follow an encapsulating design).

I know I said I'd back off on this, but something about this usage inparticular bothers me even more than the WANBootInfo case. I'll try topresent my case a little more clearly (this is the last time for thiscode review - even if nothing changes in this code immediately, perhapsI can sway you away from this type of 'encapsulation' for the future).

By lumping all the exceptions that occur during this function call intoa single bucket, you're simply begging for bugs to be masked. There's areason the Python exception hierarchy is not a flat listing of"ListError", "DictError", "TupleError", etc. It's because callers don'tcare what the class is that originally raised the error - they care whatthe *cause* is. That's why the hierarchy has "TypeError," "ValueError,""OSError," and so forth.

Callers will handle errors (or not) based on the cause, not based on thesource class. They may need to handle a missing binary separately from afailed call into that binary - even if that binary call is wrapped up ina class definition. A consumer of a class needs to know enough aboutthat class to know the different exceptions it could raise and why, andbe ready for those cases it can handle (and let those situations itcan't handle propagate up).

If a caller truly doesn't care about the cause, Python has a way tohandle multiple exceptions in a single clause (or one can handle allexceptions unilaterally with an "except StandardError" though thatshould really be used sparingly). But the class shouldn't be making thedecision for the caller that the caller will handle all error cases thesame way - and re-wrapping every exception into a single class isforcing the caller to handle them all the same way.

Finally, any value of lumping exceptions together by class goes away*completely* when you consider that, underneath it all, Python is weaklytyped. Between subclasses, duck-typed objects, mixins, metaclasses, etc.The moment you step away from "I created this object" and into "I got anobject as a parameter, and it should be a <class> or <class-like> objectwith a foobar function that raises XYZ for a given error condition",encapsulation of error conditions starts looking really ugly.

For ValueError, it should only arise in one of two cases: programmingerror, or passing in a dynamic (e.g. user-selected) value. In theformer case (which is the case for the usage scenarios in this file),again, masking with a DHCPError would make it hard to debug; in thelatter, it would be acceptable to catch, but only by the portion ofthe program "in control of" the dynamic value.
I appreciate your concern about ValueError being a programming errornot a run time issues, and as ValueError can only be returned under 2cases from subprocess.Popen() -- and we aren't going near them -- I'vepulled that exception check out of both the new and old code.


Thank you.

Overall, the original implementation was much cleaner; really, itjust needed to check the retcode of the Popen object and assert thatstdout returned a usable value (and raise an exception as needed).
No, unfortunately, that was not possible. My original version was notmodular enough -- we needed to be able to pull functionality out totest it.

You're correct. Half my brain (simple, direct code) is sometimes at oddswith the other half (direct, modular code) - the modular side wins inthis case.


- Keith

_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss


_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss

Re: [caiman-discuss] AI Multi-homed Final Code Review

Reply via email to