Currently I have 2 types of nodes:
old = 2 sockets, 4 cores per socket, 64GB mem
new = 2 sockets, 6 cores per socket, 128GB mem
Since I'm using select/cr_cons and using CR_CPU_Memory, I thought I'd
assign as default the relative amount of memory per core,
old - DefMemPerCPU = 8000
new - DefMemPerCPU = 20000
However, those values are part of the partition, not node, definition.
How can I assign those defaults to the cluster, yet define a single
global partition to allow jobs to utilize the entire cluster?
Assume tux[001-100]=old, tux[101-200]=new
I assume something like
PartitionName=Default Nodes=tux[001-100] DefMemPerCPU=8000
PartitionName=Default Nodes=tux[101-200] DefMemPerCPU=20000
PartitionName=compute Nodes=tux[101-200] Default=yes State=up
will not work.
What is the correct way to represent/use this cluster?
The other option I could think of was set DefMemPerCPU=1 to the entire
cluster, and force users to always use --mem, but I'm hoping to avoid
this kind of solution.