Re: [caiman-discuss] Review: ManifestParser/ManifestWriter Design Spec

Keith Mitchell Fri, 25 Jun 2010 12:02:44 -0700

Hi Dermot,

Your responses below and in your other follow-up email detailingdecisions from the meeting all sound good to me.


- Keith

On 06/23/10 05:37 AM, Dermot McCluskey wrote:

Keith,

Thanks for reviewing.  Responses below.

On 06/23/10 00:26, Keith Mitchell wrote:
Hi Dermot,
First comment - the link given points to the wiki "preview" pagewhich requires edit privileges. The correct link would be:http://hub.opensolaris.org/bin/view/Project+caiman/OpenSolarisInstaller+-+ManifestParser+and+ManifestWriter
Sorry for the error and thanks for correcting this.
My comments are below. Overall, the design looks good; I have a fewquestions about the behavior of the interfaces, as well as theexceptions exposed. (Thanks for calling out the differences from theprior ManifestParser - that was interesting and a valuable point ofcomparison)
3.4.1.1: The behavior and defaults for validate_from_docinfo leavessomething to be desired in the scenario of the XML not having aDOCINFO section with a listed DTD. Better default behavior could beachieved by setting the default to None, and then, "ifvalidate_from_docinfo is None" AND the supplied XML file has a givenDTD, perform the validation. If the caller explicitly wantsvalidation from docinfo, they pass in True (in which case, absence ofthe DOCINFO/DTD data would be an error, and if they explicitly don'twant validation, then pass in False.
I fully expected that this bit would need to be re-worked, so I'm gladyou
commented on this.  I was basically trying to map onto the functionality
provided by lxml and allow the document to be parsed and validated in
one pass, instead of having to parse it first and see if it has a DTDbefore
validating.

In any event, the time taken for ManifestParser is miniscule compared to
other checkpoints, so it's not really significant, but I like yoursuggestion,
where validate_from_docinfo = [None|True|False] which allows for a
sensible default and still supports one-pass parsing -and-validation,if desired.
I'll change to do as you suggest.
[btw, the error returned by lxml for DTD validation failure differsslightlydepending of whether single-pass or two-pass parsing and validation isdone.So, perhaps this is a reason not to support single-pass: to keep errorreporting
consistent?]
page 13 - execute/dry_run behavior: I think a dry_run forManifestParser or ManifestWriter would still add items to the DOC.Consider for example, a full dry-run of AI - the subsequentcheckpoints would need the data to perform their dry_run actions.
Makes sense. I'll change it so dry_run will be ignored forManifestParser.
page 13: execute/raises: For an application attempting to recoverfrom such an error, how would they differentiate between thedifferent error cases and inform the user? If the manifest doesn'tvalidate, how can the app inform the user exactly where in the XMLfile the parsing failed? I think ManifestParser needs a very solid,usable method for informing the user of how to fix an error in theirmanifests, which means the errors raised need to be readilyunderstandable by the application and provide plenty of data, so Ithink this section should be expanded. For example, right now,depending on the error one has in an AI or DC manifest, it's possibleto end up with a generic, and useless "could not parse manifest"error with no additional data - that doesn't really help an end-userfigure out that they had a typo on line 78 and forgot to close the"<pkg_repo_addl_authority" tag.
In particular, subclasses of ManifestError that group the errors intodifferent cases would be valuable. And for some of those cases, theerror may benefit from providing more than simply string data. (e.g.,line number, etc.)
lxml returns a fairly useful text description for XML syntax errors,etc. My intention was that allManifestParser errors, from cannot-find-file to does-not-validatewould return the same exception,
but the exception's message would give a detailed description, eg:

Unable to read Manifest file [dc.xml]: No such file
or
XML syntax error in Manifest file [dc.xml]: Opening and ending tagmismatch: software_spec line 59 and transfer, line 66, column 12
or
Validation against DTD [dc.dtd] failed: No declaration for attributemod_name of element checkpoint, line 43, column 27
and the application would just present this message to the user.
If more finely grained control is needed, then I can certainlysub-class ManifestError as you suggest,although offering the line number in a separate value seems excessive- would we really use this
in our applications?
An indication of how the ManifestErrors wrap around or enhance theerrors raised by lxml would be valuable as well.
The exceptions being caught from lxml are:
   IOError
   etree.XMLSyntaxError
   etree.DTDParseError
In all cases, a ManifestError is raised, taking its message from theabove exception
(plus some additional text).
When DTD validation fails, lxml returns an error code and puts thereason in anerror log. ManifestParser raises ManifestError, using the text fromthe error log
(plus some additional text)

I'll add a summary of the above to the document.
page 14: What factors, if any, would affect the value returned fromget_progress_estimate() for this checkpoint? For example,transfer/IPS depends on the number and size of packages beinginstalled - are there any similar factors at play here, or is itexpected to be static (I'm mostly just curious - I can't think ofanything off-hand).
- size of manifest and DTD files, but this would not be significant
- network access is disabled for loading XML and DTD files,
 so that should not be a factor
I can't imagine get_progress_estimate() will return anything otherthan 1, really.
Also, I should add a comment to the spec to state that ManifestParserand ManifestWriterwill not be providing progress updates during their execution.Execution time isexpected to be minimal (c. 0.12 seconds in my prototype), so updatingprogress
would not be very useful.
page 16: ManifestWriter/execute/dry_run behavior: For theManifestWriter, on the one hand it could be argued that a dry_runcould still benefit from presenting the user with an output manifest(perhaps just on stdout?). Depending on how a user or developerrunning an app specifies dry_run and an output file, it may be bestto just always dump to XML.
How about if I write it to the log, but do not create the output file,when dry_run = True?
page 16: execute/raises: Again, more detail would be valuable,although the ManifestWriter is probably better protected fromend-user errors/typos then the parser so this is perhaps less critical.
As per ManifestParser, I can sub-class the exception class if moreprecise
errors are required?


Thanks,
- Dermot
Thanks,
Keith

On 06/17/10 06:02 AM, Dermot McCluskey wrote:
Hi,

I hope there is still some enthusiasm left for reviewing design specs?
I'd like to request a review for my design for ManifestParser andManifestWriter.
The latest version of the document can be found here:
http://hub.opensolaris.org/bin/preview/Project+caiman/OpenSolarisInstaller+-+ManifestParser+and+ManifestWriter
As this component is slightly less complex than others that haverecently beenreviewed, I'd like to request that comments be sent by Friday June25th. If this
is not practical, I will extend this date.
Please let me know if you intend to review this so I can be sure Ihave enoughreviewers. I would particularly appreciate comments from some ofthe following:
   Jack, Alok, Karen, Darren, Sarah.


Thanks,
- Dermot

_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss
_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss


_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss

Re: [caiman-discuss] Review: ManifestParser/ManifestWriter Design Spec

Reply via email to