Hello I am trying to use PAV to run pvfs with the MX protocol. I've updated pav so that servers start and ping correctly. But when I try and run an mpi code, I'm getting client timeouts like the client cannot contact the servers:
Lots of this stuff: [E 19:11:02.573509] job_time_mgr_expire: job time out: cancelling bmi operation, job_id: 3. [E 19:11:02.583659] msgpair failed, will retry: Operation cancelled (possibly due to timeout) I have no problem acknowledging that I've done something wrong, but I don't know how to debug MX at all. Any pointers to at least get me started? Cheers, brad _______________________________________________ Pvfs2-users mailing list [email protected] http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
