> -----Original Message----- > From: devel-boun...@open-mpi.org > [mailto:devel-boun...@open-mpi.org] On Behalf Of Matthijs Richard Koot > Sent: Wednesday, June 14, 2006 1:04 AM > To: de...@open-mpi.org > Subject: [OMPI devel] Q: Job scheduling of MPI applications? > (in general) > > I'm new to this list, and have a question regarding the how > MPI jobs are scheduled by JMSs. If I understand correctly, to > have decent management/scheduling of MPI jobs, there are > requirements for both the MPI implementation and JMS > implementation, for them to be 'integrated': > > - the JMS needs to be 'parallel-aware', i.e. implement the PSCHED API;
It is probably more precise to say that the JMS ("Job Management System"?) needs to provide a mechanism to start jobs on allocated nodes. If it provides a parallel mechanism (e.g., a caller can invoke one command to launch many processes), so much the better -- but if the mechanism is serial, that's ok too. All common resource managers provide *some* way of launching jobs on allocated notes -- indeed, that is one of their main purposes (to start / stop jobs). The PSCHED API is one of several such interfaces. A subset of the PSCHED API is only in common use in the PBS line of resource managers (Torque, PBS Pro, etc.). I doubt that TM is the native interface that the PBS flavors use to launch jobs (i.e., I doubt that PBS uses TM internally for launching processes), but I have not dived into the implementation enough to know. Other resource managers have different interfaces. > - the MPI needs to be 'JMS-aware', i.e. call the PSCHED > functions at the JMS. That's correct in spirit, but a little more precise would be to say that the MPI needs to be aware of and properly utilize the mechanism that the resource manager provides to start jobs. > My questions: > 1. Is this correct? > 2. Which question should is valid: "does OpenMPI support > SGE?", or: "does SGE support OpenMPI"? It's probably more correct to ask if Open MPI supports a given resource manager. > 3. How do I know which JMSs (Torque/OpenPBS, SGE, LSF, ...) > are compatible with which MPI implementations (OpenMPI, > MPICH, MPICH-G2, ...), and vice versa? I can't speak for the other MPI implementations, but for Open MPI, you can look here: http://www.open-mpi.org/faq/?category=supported-systems#rte > 4. Is it true that the PSCHED API is the 'de facto' for such > integration? No. It was an attempt to standardize such things, but it never really caught on outside of the PBS family. -- Jeff Squyres Server Virtualization Business Unit Cisco Systems