Hi, I am testing an IA64 PVFS client using IB. The client seems to behave and work well most of the time. For some reason under certain uses the pvfs IA64 client hangs. Can't quite reproduce the hang but it happens often enough to be annoying.
On the client and on the Metadata server I find messages like this: IA64 client ---------------- [E 11:49:06.762417] job_time_mgr_expire: job time out: cancelling bmi operation, job_id: 2425168. [E 11:49:06.762762] msgpair failed, will retry: Connection timed out [E 11:49:06.762796] *** msgpairarray_completion_fn: msgpair to server ib://hpcxe001:3337,tcp://hpcxe001:3336 failed: Connection timed out [E 11:49:06.762810] *** Non-BMI failure. [E 11:49:06.762823] getattr_object_getattr_failure : Connection timed out [E 11:54:07.121232] job_time_mgr_expire: job time out: cancelling bmi operation, job_id: 2425476. [E 11:54:07.121277] job_time_mgr_expire: job time out: cancelling bmi operation, job_id: 2425478. pvfs metadata server --------------------- hpcxe001: [E 10/11 11:44] job_time_mgr_expire: job time out: cancelling bmi operation, job_id: 4432802. pvfs i/o server ---------------- hpcxe005: [E 10/11 11:28] job_time_mgr_expire: job time out: cancelling bmi operation, job_id: 5946725. Anyone know what this means? Anyway to get pvfs-client started in a more verbose or debug mode so it can log more info for me to look at? Thanks Rene _______________________________________________ Pvfs2-users mailing list [email protected] http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
