Mark/Carl, slurmctld is running now and talking to the bluegen. I'm currently trying out some test jobs.
Thanks very much to you both for your help and expertise. James On 01/08/2012 02:31, Mark Nelson wrote: > > Hi Carl, > > On 01/08/12 05:47, Carl Schmidtmann wrote: >> >> ----- Original Message ----- >>> Carl, 2.3 does not work on a Q so you should avoid it. >> >> I was about to say the same thing for 2.4.1 ;). >> >> James, >> >> I think I have a partial answer to your problem. You need to specify a >> FrontendNode. Without that slurmd uses the NodeName as a FrontendNode and >> tries to add it to the hash table creating the duplicate error. After I >> added a FrontendNode that problem went away. > > Ah yes, that is indeed your problem. Sorry, I skimmed the slurm.conf and > it didn't jump out at me that you were missing the definition of the > frontend node. If I comment out the appropriate line in my slurm.conf > (FrontendName=slurm-dev State=UNKNOWN), I get exactly the Duplicated > NodeName error that you're seeing. > >> >> Now I am trying to figure out how a bluegene.conf file, or just the blocks >> section, can be generated by smap. Several places say to let smap create the >> file but there is nothing in the smap man page nor the bluegene.conf man >> page about how to do this. Slurmd won't run without bluegene.conf and smap >> won't run without slurmd. So how can it generate the block specifications to >> put into the bluegene.conf file? > > You should be able to start smap using the -Dc option without having any > of the daemons running (although you will need a valid slurm.conf). And > then when the curses interface opens up you can use the create option to > create blocks. For example, call "create 1" eight times to create eight > midplane blocks. Then you can use "save bluegene.conf". > > Oh, also, on our system we had to set the MloaderImage to be > /bgsys/drivers/ppcfloor/boot/firmware rather than uloader in the > bluegene.conf. > > Let us know how you go. > > Thanks, and good luck! > Mark > >> >> I am going to create a file based on Mark's example file just to get slurmd >> running and then see what smap will show me. >> > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.
