Re: [Pvfs2-developers] bmi questions

Phil Carns Fri, 18 Aug 2006 06:10:48 -0700

I have some questions related to the design semantics of BMI.
* timeouts. It looks like the timeout for bmi test calls is the maxamount of time spent _idling_ in the test call (as apposed to the maxtime spent in the test call).

This is correct. The name of the argument is max_idle_time_ms. Themain reason it was put there is to give an opportunity to prevent BMIfrom busy spinning when it is polling for completion. The moretraditional timeout semantics (where you wait up to N seconds forsomething specific to finish before giving up, whether busy or not) isimplemented at the job level. When the job level doesn't want BMI toblock, it sets max_idle_time_ms to 0, but when it is doesn't really havemuch else to do it will set it to a few milliseconds. This is enough toprevent high cpu usage, but still low enough for us to pop out and doother occasional book keeping at the job level.

In other words, if operations are beingcompleted continuously, then the timeout is never triggered, and thecall can block for much longer than the actual timeout specified.

I don't think this is true in practice, because we never loop (withinbmi) over a function that can idle. The bmi_tcp and bmi_gm methods takethis approach to implementing the max idle time:


- check completion queue: if find something, return immediately

- call a generic progress function that may idle for as long asmax_idle_time_ms but will exit as soon as it gets any work done (thework may or may not be related to what the caller tested for)

- check completion queue: if find something, return immediately

So the only way that this function can block much longer thanmax_idle_time_ms is if checking the completion queue takes a long time.Completion checking is typically very fast though; testsome() andtest() map ids directoy to operations so there is no data structuresearching, while testcontext() just takes the first N available itemsfrom the completion queue.

Isthis the desired behavior? The concern would be that the bmioperations would be completed at a constant rate, causing a burstybehavior of completed bmi jobs.

I don't think it is particularly bursty, but the test functions willalways return as much as they can from the completion queue when theycheck, on the theory that the caller can do a better job of figuring outwhat to do with them. There isn't much reason for the BMI layer tothrottle completed operations.


> The incount constrains this,  but for
> both bmi api users and bmi method implementors we should  probably
> document all those semantics.

This stuff could definitely stand to have much better documentation.

-Phil
_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Re: [Pvfs2-developers] bmi questions

Reply via email to