Hi all, I've recently upgraded the Ubuntu OS on a few independent (i.e., they are not part of the same cluster) servers. This brought me up from version 14.11 of SLURM to 15.08 .
While things have gone fine, on at least two occasions, I've noticed a problem. At the end of slurm.conf, I have an entry like this: NodeName=mynode CPUs=24 SocketsPerBoard=2 CoresPerSocket=6 ThreadsPerCore=2 RealMemory=32057 State=UNKNOWN After using the web-based configuration tool, some web site (sorry, I forgot where I saw it) suggested I run "slurmd -C" and then copy the values to the end of the file. That is what I've done. Anyway, after upgrading the OS, something strange happened. The queue failed to start and I got error messages in the log about the RealMemory being insufficient. I ran "slrumd -C" again, and the value of RealMemory changed! For example, from 32058 to 32057. So...I changed it in slurm.conf and restarted slurm and all is fine now. But, I was wondering ... is this something I should be alarmed about? How could I "lose" memory during an OS upgrade. Is it an approximation and there is some kind of rounding error? Ray
