I've started a github page for some tools I've put together from various
bits of code, how-tos, etc. to simplify the setup of parallel
environments so that they work universally for all MPI implementations
(on x86_64 Linux) w/ tight-integration support (no support for ssh yet).
The syntax for submitting parallel jobs becomes more similar to
LSF/PBS/Torque and provides for easy configuration of your task layout
(ppn,nodes,pcpus,pcpus_min,pcpus_max). We use a JSV to make the magic
happen. We create PEs tied to queues since our queues often delineate
changes in the underlying communication fabrics available.
A lot of the start/stop pe code comes from various how-tos put together
by the GE gurus on this list, though it has been modified. I'm curious
to see if anyone would find this useful or would want to test/use it in
their environments. I've already had some success with beta testers,
greatly simplifying their submit scripts (for more complex cases) and
shortening our documentation :)
The project page is at: https://github.com/brichsmith/gepetools
Suggestions for added features would be appreciated!
Brian Smith
Senior Systems Administrator
IT Research Computing, University of South Florida
4202 E. Fowler Ave. SVC4010
Office Phone: +1 813 974-1467
Organization URL: http://rc.usf.edu
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users