Re: [caiman-discuss] Install Engine Design Document review

Karen Tung Thu, 10 Jun 2010 16:27:54 -0700

Hi Jean,

Thank you again for reviewing the document.  My responses are inline.


On 06/09/10 10:07, jean.mccormack wrote:

Part 2.....

7.4 Implementation
---------------------------------
Why do you change where the snapshots of the DOC are stored? If forthe first n checkpoints they are in /tmpwhy even bother with putting the last x snapshots somewhere else?Also, what if there are multiple install targets?Where do the snapshots go then? Also, sometimes the snapshots are in/tmp/doc_<checkpoint name>_<pid>. Whathappens in the case of DC on a resume when the pid is now different.How do you know where to get the info from?

* DOC snapshots are stored in /tmp before the install target isavailable for storing them. When the installtarget is available, I store the DOC snapshots in the ZFS datasets sothey are included in the ZFS snapshots of

the install target too.

* If there are multiple install targets, I was planning to just use thefirst install target. Dave made a pointin his review comment that the engine should provide a function for theapplication to explicitlyspecify what ZFS dataset to use for storing the snapshots. I thinkthat's a great idea. If wedo it that way, we also won't need to worry about having multipleinstall targets.

* The DOC snapshots stored in /tmp, before the ZFS dataset is availableis only used for in-process resume.For the case of DC, where the pid is different on a subsequentinvocation of DC, users are not allowedto resume at checkpoints before the ZFS dataset is available, becauselike you said, we won't be able

to access the files from /tmp anymore.

You say this:
If ZFS dataset is not available, the engine will query theDataObjectCache to see if the install
  target is set, and if so, whether it is created.
That kind of leaves me hanging. If it is created what happens? Andlikewise, what if it isn't there?

If it is created and available, it will be used. If it is not ready tobe used, we will nottake any ZFS snapshot. We will just take the DOC snapshot and put it in/tmp.

 7.4.1 Determining which checkpoint can be resumed to
----------------------------------------------------------------------------------------------
This is misleading:
The checkpoint must be registered at exactly the same position in thecheckpoint list as the
previous invocation of the application.
Then you go on to explain which is in my mind correct. But the firstbullet I mention above is
not the same as your explanation.

Does my explaination in the first bullet above help?

The DOC contains information on all the successfully executedcheckpoints. So, havinga snapshot of it will help determine what checkpoints was last executed,and in what

order are they executed in.

page 16
7.5 Function Definition
---------------------------------------
when talking about resume_execute_checkpoint you say:
This function can only be called once in each invocation of theapplication.
Why? I would think that if you had checkpoints a,b,c,d,e,f you could
resume from b and pause at c and then resume from c and pause at e ifyou wanted.

resume_execute_checkpoint() is used for cases where one wants to resumefrom a previousinvocation of the program. So, it does the check to see whether thespecified checkpoint_nameis resumable based on the "rules", does the rollback of the ZFS snapshotand DOC snapshot...etc..

Let me clarify this in the function description.

To do the example you had above, in the same process,
you don't need to use resume_execute_checkpoint().  You can
just do it with multiple execute_checkpoints(), for example:

execute_checkpoints(start_from="a", pause_at="c")
execute_checkpoints(start_from="c")

page 18

10.1 Progress Estimates
-------------------------------------------
You say this:
"The checkpoint developer should run their checkpoint on thisstandardized machine, and based on some metric, the amount of time ittakes to execute that checkpoint will be converted to a value thatwill be returned as weight."
My issue with this is that it doesn't take into account things likenumber of packages being installed or size of the area being cpio'detc. For some checkpoints your statement will work, for others itwon't. Specifically, I think it won't for target discovery or transfer.

Yes, all the things you mentioned above should be included in theprogress estimate calculation by the checkpoint.

Speed of the network, speed of the disk, processor speed...etc.. variesgreatly and would affect performance,even if we were to install 1 package or discovery 1 target. Therefore,I talked aboutusing a standardized machine to fix those unknown values, and let theother "measurable"variables like number of packages to install, and size of image to cpiobe calculated based on that.


Let me add more detail to this section to clarify.

Thanks again for your review.

--Karen



On 05/28/10 04:12 PM, Karen Tung wrote:

Hi,

The draft install engine design doc is ready for review.
I pushed the doc to the caiman-docs repo.  You can
find them under the install_engine directory in
ssh://[email protected]/hg/caiman/caiman-docs

If you don't want to clone the gate, you can also access
the file directly here:

http://cvs.opensolaris.org/source/xref/caiman/caiman-docs/install_engine

You will find 2 files in the directory:
* engine-design-doc.odt is the design document

* engine-sequence-diag-0528.png is the sequence diagram inserted intochapter 14 of the document. You might want to see the bigger imagedirectly.


Please send your feedback by 6/11/2010.

If you plan to review the document, please send me an email privately,
and preferably also let me know when you will complete the review, so
I can plan accordingly and ensure the document is reviewed thoroughly.

Thanks,

--Karen
_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss


_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss

Re: [caiman-discuss] Install Engine Design Document review

Reply via email to