[Pvfs2-developers] bmi testcontext/testunexpected

Sam Lang Mon, 22 Dec 2008 13:07:10 -0800


Hi All,

I think Nawab has found a bug (or untested code path) in the BMI tcpmethod. He's running a daemon that both receives unexpected requests(as a server), and receives expected responses (as a client).

In the BMI_testcontext call, if there aren't any completed (expected)operations, and there are completed unexpected receives, we returnimmediately, assuming that BMI_testunexpected will be called in turn.I think the idea here is that we want to keep our latency down forunexpected messages, instead of doing work on expected messages whileunexpected messages are waiting in the hopper. But the daemon issingle threaded, and making blocking PVFS_sys_* calls, so weessentially spin forever calling BMI_testcontext over and over.

I'm not sure of the best way to fix this. Easy fixes would be toremove the check for completed unexpected receives, and/or dotcp_do_work for a shorter timeout.

It seems like we have a special case for blocking PVFS_sys_* calls.We want to ignore unexpected receives just in that case, and actuallycall tcp_do_work. In other contexts, I think we want the behaviorthat we have now, where we assume that a BMI_testunexpected call willfollow a BMI_testcontext call. We could modify the testcontext callto take a separate parameter, but that seems messy. We might also beable to handle this with separate BMI contexts somehow...


-sam
_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

[Pvfs2-developers] bmi testcontext/testunexpected

Reply via email to