Lee Whatley, Contractor wrote:
Pete Wyckoff wrote:
I think it's best if I get around to doing the event-driven bmi_ib
rather than polling and see if that magically fixes it.  Playing
thread scheduling tricks will get us in trouble, as Nathan points
out.

Well, I'm hoping to upgrade this cluster from RHEL3 (2.4 kernel) to RHEL4 (2.6 kernel) sometime in the next few months. I have a feeling alot of my problems will go away once that is done.

Hey Pete,

FYI I was finally cleared to upgrade my cluster to RHEL4 (2.6 kernel). Unfortunately this doesn't look like it fixed my problem. Doing any operations on a pvfs2 filesystem over native infiniband (i.e. not tcp or IPoIB) are extermely slow. Just a simple "ls" on a pvfs2 filesystem with a handful of files and directories takes 5-10 seconds and the pvfs2-server process takes up 98% of the CPU.

Because of the operational demands of the users on this cluster I can't change the filesystems back from tcp to ib and get you some debug info right this moment. I'm hoping I can set up some playspace where I can give you some more details later this week.

I'll keep you posted,
-Lee

_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Reply via email to