The test environment is simple. There are two nodes, one node A creat a checkpoint and do some write in a tight loop, then the physical connection is dropped, at the same time(about 10ms later than A's physical connection dropped) if the other node B wanna do saCkptCheckpointOpen or saCkptCheckpointRead of the opened checkpoint, it will be blocked at saRecvRetry for 3 seconds. After that saRecvRetry will return with OK. The checkpoint is about 3MB, and the network connection between A and B is 1Gbps.
Thanks. > Subject: Re: [Openais] Need help to reduce the time wait of saRecvRetry() > From: [EMAIL PROTECTED] > To: [EMAIL PROTECTED] > CC: [email protected] > Date: Mon, 20 Oct 2008 13:14:58 -0700 > > > On Mon, 2008-10-20 at 14:20 +0800, Ratbag Patrick wrote: >> How To Reproduce: >> Using the latest version of whitetank, creat one ckpt, then if one >> node's physical network connection is dropped(like unplug the RJ45 >> jacket), then at the same time the other node will be blocked at >> saRecvRetry() for about 3 seconds. >> >> >> Actually it's not acceptable in my envronment. Could anyone tell me >> how to reduce the blocked time(like 3s to 100ms) or tell me if it's a >> bug? >> > > I'll take a look when I have an opportunity. > > What is the other node doing in saRecvRetry? Reading the checkpoint in > a tight loop? > > If you have many checkpoints that call should still not block but might > return SA_AIS_ERR_TRY_AGAIN for some long period of time. > > Regards > -steve > >> Thanks! >> >> _______________________________________________ >> Openais mailing list >> [email protected] >> https://lists.linux-foundation.org/mailman/listinfo/openais > _________________________________________________________________ Discover the new Windows Vista http://search.msn.com/results.aspx?q=windows+vista&mkt=en-US&form=QBRE _______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
