Setting the stack size limit to the vmem limit was fixed long time ago in 2009 for SGE 6.2u3 hence it should work in all later versions as well as in all SGE 6.2u5 successors like Univa Grid Engine. Hence the exact version number would be interesting.
Daniel Am 26.07.2012 um 18:02 schrieb Christoph van Wüllen: > It is a long-standing problem that due to a bug in Sun GridEngine > (setting the stack size limit equal to the address space limit) > using qrsh from within OpenMPI fails if a large memory is requested > but the stack size not explicitly set to a reasonably small value. > > The best solution were if SGE just would not touch the stack > size limit and leave it at INFINITY. > > However I have tested that just reducing the stack size limit in > file orte/mca/plm/rsh/plm_rsh_module.c, function ssh_child() before > execv'ing qrsh circumvents the problem, so just after exec_patch is set > by strdup(...) I inserted the lines > > { > struct rlimit rlim; > int l; > > l=strlen(exec_path); > if (l > 5 && !strcmp("/qrsh", exec_path + (l-5))) { > getrlimit(RLIMIT_STACK, &rlim); > if (rlim.rlim_max > 10000000L) rlim.rlim_max=10000000L; > if (rlim.rlim_cur > 10000000L) rlim.rlim_cur=10000000L; > setrlimit(RLIMIT_STACK, &rlim); > } > } > > > It looks quick-and-dirty and it certainly is, but it solves a severe > problem many users have with OpenMPI and SGE. Feel free to use this > information as you like. Note that MPI worker jobs eventually > spawned off on "distant" nodes do not suffer from the reduced stack > size limit, it is only the qrsh command. > > Is this (still) of interest? > > +---------------------------------+----------------------------------+ > | Prof. Christoph van Wüllen | Tele-Phone (+49) (0)631 205 2749 | > | TU Kaiserslautern, FB Chemie | Tele-Fax (+49) (0)631 205 2750 | > | Erwin-Schrödinger-Str. | | > | D-67663 Kaiserslautern, Germany | vanwul...@chemie.uni-kl.de | > | | > | HomePage: http://www.chemie.uni-kl.de/vanwullen | > +---------------------------------+----------------------------------+ > > > _______________________________________________ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel