Hi Uday, Could you shed some light on what sort of workload(s) you were running over the 4-5 day period after which this situation was encountered? Also if you could do an lsof -p `pidof pvfs2-server` that might tell us where/how all the fd's of the server are being used for.. Thanks for the report! Murali
> Hi, > We are noticing a few problems when running PVFS2 version pre1.3.1. > > - pvfs2-client-core memory keeps increasing after using the system for > 4-5 days. We have seen it go as high as 900M, at which time, we have to > kill it and remount the file system. > - The pvfs2 client log shows the following errors. The server does not > have any corresponding error. > > [E 23:31:07.186802] *** msgpairarray_completion_fn: msgpair failed, no > retry: Connection reset by peer > [E 23:31:07.186856] Error: state machine using an invalid termination > path. > > - We have also seen an error in the pvfs2-server log indicating the the > pvfs2-server ran out of max open file descriptors. I was unable to > capture the log message as the server clobbered the log (instead of > appending to it) when I restarted the pvfs2 server. The kernel > parameters for max open files are quite high. > > - One pvfs2-server in group of 8 keeps running at 99.9% cpu. > Incidentally this is the same server which ran out of max open file > descriptors. > > > Has anyone seen these or similar errors? I would appreciate if anyone > can share their experience in resolving these errors. > > thanks, > uday > > > > > > _______________________________________________ > PVFS2-users mailing list > [email protected] > http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users > > _______________________________________________ PVFS2-users mailing list [email protected] http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
