On Sun, Oct 07, 2007 at 07:10:20PM -0500, Sam Lang wrote: >>> pvfs2-client-core: src/client/sysint/sys-io.sm:1860: >>> io_post_write_ack_recv: Assertion `ret == 0' failed. >> >> Interesting.. It looks like job_bmi_recv() was returning 1 indicating >> immediate completion for the final write ack's receive which seems >> impossible.. >> Sam, Phil: That shouldn't happen, right? > > Right. It should be returning an error if it fails -- never 1. > > Anthony, which BMI method (tcp, ib, gm) are you using with your setup?
I'm using tcp BMI. It is returning an immediate successful receive of 0x18 bytes. I dont know enough about pvfs internals to be sure this is the correct fix, but for this state it seems to work if I set write_ack_has_been_posted and return success.. I get successful writes with correct data, no io hangs, no crashes.. >> I don't see this on my setup even with -threaded client-cores... >> >> Sam: WHat is the reason for building the -threaded version of >> pvfs2-client-core? > > Multiple threads allows for a device thread to handle unexpected operations > from the kernel module. We were seeing minor performance improvements with > some tests we did, especially for smaller IOs. > > -sam > >> I forget now..BMI on client drives progress using threads? >> The main thread/event loop is certainly uni threaded as far as I can >> tell. >> thanks, >> Murali >> _______________________________________________ >> Pvfs2-users mailing list >> [email protected] >> http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users >> _______________________________________________ Pvfs2-users mailing list [email protected] http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
