We face this problem on the Lustre servers on our cluster with GigE network. We found that increasing the following value in /etc/sysctl.conf forces the kswapd to kick in a lot earlier and prevent the scenario that you are talking about. Our servers have only 8GB memory, you might want to bump it up to 2GB or even 4GB with 32GB system memory.
# Control the min_free_kbytes vm.min_free_kbytes = 1048576 Hope this helps. Nirmal _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
