Hi,

I am testing an IA64 PVFS client using IB.  The client seems to behave
and work well most of the time.  For some reason under certain uses the
pvfs IA64 client hangs.  Can't quite reproduce the hang but it happens
often enough to be annoying.

On the client and on the Metadata server I find messages like this:

IA64 client
----------------
 [E 11:49:06.762417] job_time_mgr_expire: job time out: cancelling bmi
operation, job_id: 2425168.
[E 11:49:06.762762] msgpair failed, will retry: Connection timed out
[E 11:49:06.762796] *** msgpairarray_completion_fn: msgpair to server
ib://hpcxe001:3337,tcp://hpcxe001:3336 failed: Connection timed out
[E 11:49:06.762810] *** Non-BMI failure.
[E 11:49:06.762823] getattr_object_getattr_failure : Connection timed
out
[E 11:54:07.121232] job_time_mgr_expire: job time out: cancelling bmi
operation, job_id: 2425476.
[E 11:54:07.121277] job_time_mgr_expire: job time out: cancelling bmi
operation, job_id: 2425478.



pvfs metadata server
---------------------
hpcxe001: [E 10/11 11:44] job_time_mgr_expire: job time out: cancelling
bmi operation, job_id: 4432802.


pvfs i/o server
----------------
hpcxe005: [E 10/11 11:28] job_time_mgr_expire: job time out: cancelling
bmi operation, job_id: 5946725.


Anyone know what this means?  Anyway to get pvfs-client started in a
more verbose or debug mode so it can log more info for me to look at?

Thanks
Rene
 
_______________________________________________
Pvfs2-users mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users

Reply via email to