Currently I have 2 types of nodes:
old = 2 sockets, 4 cores per socket, 64GB mem
new = 2 sockets, 6 cores per socket, 128GB mem

Since I'm using select/cr_cons and using CR_CPU_Memory, I thought I'd assign as default the relative amount of memory per core,
old - DefMemPerCPU = 8000
new - DefMemPerCPU = 20000

However, those values are part of the partition, not node, definition.

How can I assign those defaults to the cluster, yet define a single global partition to allow jobs to utilize the entire cluster?
Assume tux[001-100]=old, tux[101-200]=new

I assume something like
PartitionName=Default Nodes=tux[001-100] DefMemPerCPU=8000
PartitionName=Default Nodes=tux[101-200] DefMemPerCPU=20000
PartitionName=compute Nodes=tux[101-200] Default=yes State=up

will not work.

What is the correct way to represent/use this cluster?
The other option I could think of was set DefMemPerCPU=1 to the entire cluster, and force users to always use --mem, but I'm hoping to avoid this kind of solution.

Reply via email to