Re: [slurm-dev] How to find out which cores of a node are allocated to a job

Moe Jette Tue, 24 Jan 2012 08:52:29 -0800

Use the command "scontrol show job --detail". The output will containa line like this for each node allocated to each job:

     Nodes=tux123 CPU_IDs=2-5 Mem=2048

While the data does exist, that's not going to be particularly simpleto parse and work with. There has been talk about adding an "--xml"option for XML output from scontrol, but that has never been done.Since SLURM is open source, you could modify scontrol to add an"--xml" option or build a new tool for your particular application.


Moe Jette
SchedMD

Quoting Mark Nelson <mdnels...@gmail.com>:

Hi there,
My colleague came up with the question below about running jobs on anormal x86 based cluster. Hopefully someone here can shed some lighton this.
When running SLURM on a multi-core/multi-socket cluster systems isthere any way of finding out the cores allocated for a particularjob. Using "scontrol show job" I can find out which nodes areallocated and a total number of cores, but have no way of knowinghow these cores might be distributed across the nodes. While thesystem seems to allocate cores consecutively, across multiple jobsthere is no way of knowing which cores are assigned to which job.For example, in an 8-core multi-node system, if I ask for 3 coresacross 2 nodes (salloc -n 3 -N 2) how do I know if 2 cores areallocated from the first node and 1 core from the second orvisa-versa. Also as nodes are filled up with other jobs, and jobsfinish at different times, there is no way of mapping jobs toparticular cores. I've seen from other postings that SLURM corenumbering might not match the physical hardware core numbering, butfor my purposes this is not a problem, as long as the numbering isconsistent.
The reason I'm asking this question, is I'm trying to integrateSLURM with PTP (Eclipse Parallel Tools Platform) system monitoringthat expects to map jobs to nodes and cores in a graphicalinterface. Therefore for jobs on a multi-core cluster, I need toreport on which cores and nodes a particular job is running, in aspecified XML format.
Many thanks!
Mark.

Re: [slurm-dev] How to find out which cores of a node are allocated to a job

Reply via email to