Am 17. Oktober 2017 23:12:35 MESZ, schrieb Daniel Barker <danba...@umich.edu>: >Hi, All, > >I am gathering hardware requirements for head nodes for my next >cluster. >The new cluster will have ~1500 nodes. We ran 5 million jobs last year. >I >plan to run the slurmctld on one node and the slurmdbd on another. I >also >plan to write the StateSaveLocation to an NFS appliance. Does the >following >configuration look sufficient? > > >Node1: >slurmctld >128GB ram >2TB local disk >12 core high clock rate CPU > >Node2: >slurmdbd >slurmctld backup >128GB ram >2TB local disk >mirrored 500GB SSD for database >12 core high clock rate CPU > > >Do I need more RAM in either node? Is12 cores enough? Is 500GB large >enough >for the Slurm database?
What do you do when Node2 goes down? Regards, BR -- FSU Jena | JULIELab.de/Staff/Benjamin+Redling.html vox: +49 3641 9 44323 | fax: +49 3641 9 44321