Hi,

I'm trying to set up a new cluster with stateless nodes, but whatever I do,
I can only seem to get it to boot to the genesis image.

I set up my node manually (no discovery); the node does not have IPMI, BMC
- just plain PXE. The node does get the correct IP from the DHCP server.

My hunch is that my problem is related to the node's chain attribute, but I
am not sure what to set there. Most people seem to set
"runcmd=bmcsetup,shell" but since I don't have a BMC, that's probably not
the correct setting for me.

Or maybe my problem is somewhere different?


Here is my node definition:

Object name: hpccn-001
    arch=x86_64
    groups=compute,all
    initrd=xcat/osimage/hpccn-netboot-image/initrd-stateless.gz
    installnic=mac
    ip=192.168.101.3
    kernel=xcat/osimage/hpccn-netboot-image/kernel
    mac=52:54:00:f2:e6:39
    netboot=xbna
    nichostnamesuffixes.eth0=-compute
    nichostnamesuffixes.eth2=-imm
    nicips.eth0=192.168.100.3
    nicips.eth2=192.168.101.3
    os=rhels7.4
    postbootscripts=otherpkgs,confignics
    postscripts=syslog,remoteshell,syncfiles,setupntp
    primarynic=mac
    profile=hpccn
    provmethod=hpccn-netboot-image
    serialport=0
    serialspeed=115200
    status=shell
    statustime=10-30-2017 08:37:51

And the osimage definition:

Object name: hpccn-netboot-image
    exlist=/install/custom/netboot/rh/hpccn.rhels7.4.x86_64.exlist
    imagetype=linux
    osarch=x86_64
    osdistroname=rhels7.4-x86_64
    osname=Linux
    osvers=rhels7.4
    otherpkgdir=/install/post/otherpkgs/rhels7.4/x86_64

otherpkglist=/install/custom/netboot/rh/hpccn.rhels7.4.x86_64.otherpkgs.pkglist

partitionfile=/install/custom/netboot/rh/hpccn.rhels7.4.x86_64.cfglocaldisk
    permission=755
    pkgdir=/install/rhels7.4/x86_64
    pkglist=/install/custom/netboot/rh/hpccn.rhels7.4.x86_64.pkglist
    postinstall=/install/custom/netboot/rh/hpccn.rhels7.4.x86_64.postinstall
    profile=hpccn
    provmethod=netboot
    rootimgdir=/install/netboot/rhels7.4/x86_64/hpccn
    synclists=/install/custom/netboot/rh/hpccn.rhels7.4.x86_64.synclist

The node gets the correct IP address from DHCP, and xnba booting works, but
then goes to the genesis kernel (which either stops booting, or goes to the
shell if I add chain=shell), as the Apache log file shows:

192.168.101.3 - - [30/Oct/2017:08:34:55 -0700] "GET
/tftpboot/xcat/xnba/nets/192.168.101.0_24 HTTP/1.1" 200 245 "-"
"iPXE/1.0.3-131028 (d603e)"
192.168.101.3 - - [30/Oct/2017:08:34:55 -0700] "GET
/tftpboot/xcat/genesis.kernel.x86_64 HTTP/1.1" 200 4152336 "-"
"iPXE/1.0.3-131028 (d603e)"
192.168.101.3 - - [30/Oct/2017:08:34:55 -0700] "GET
/tftpboot/xcat/genesis.fs.x86_64.gz HTTP/1.1" 200 36402928 "-"
"iPXE/1.0.3-131028 (d603e)"

-- 

_______________________________________________________________________
Kevin Keane | Systems Architect | University of San Diego ITS |
kke...@sandiego.edu
Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
<%28619%29%20260-2298>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to