I have a number of compute nodes, 2 service nodes and an xCAT MN (xCAT
2.5.2) on a flat network. I would like to have all available services
running off the service nodes, but I am running into some problems.

When a compute node PXE boots for stateful install (rhel 6.1) it gets a
DHCP response from the xCAT MN rather than the SNs. Looking on the SNs, I
see an empty dhcpd.leases file. Running makedhcp doesn't resolve this.

After the DHCP response, the node pulls its boot image and installation
from the correct SN, but it fails in updating its status and reinstalls
after rebooting. If I run nodeset node boot from the MN, some of the
postscripts don't appear to run correctly. For example, remoteshell doesn't
run, and when I run it using updatenode from the MN, I get an error:

<error>Unable to read private DSA key from /etc/xcat/hostkeys</error>
<error>Unable to read private RSA key from /etc/xcat/hostkeys</error>

Looking on the SNs, I don't see any /etc/xcat/hostkeys directory. What's
supposed to set this up?

Sharing the /install directory. Currently, my SNs are configured to
NFS-mount the /install directory from the MN on boot. Is this correct or
should they be syncing that directory? I may have missed it, but the wiki
page was unclear on this to me.

Finally, Looking on the node that was installed by the SN, I see syslog is
configured to log to the SN, but I don't see that happening.

nodels sn01 servicenode
qservice01: servicenode.dhcpserver: 1
qservice01: servicenode.tftpserver: 1
qservice01: servicenode.node: sn01
qservice01: servicenode.nameserver: 1
qservice01: servicenode.nimserver: 1
qservice01: servicenode.ftpserver: 1
qservice01: servicenode.conserver: 1
qservice01: servicenode.monserver: 1
qservice01: servicenode.nfsserver: 1
qservice01: servicenode.comments:
qservice01: servicenode.ldapserver:
qservice01: servicenode.ntpserver:
qservice01: servicenode.ipforward:
qservice01: servicenode.disable:

tabdump site
#key,value,comments,disable
"xcatdport","3001",,
"xcatiport","3002",,
"tftpdir","/tftpboot",,
"master","mn01",,
"domain","cluster.net",,
"installdir","/install",,
"timezone","America/Chicago",,
"forwarders","XXX",,
"dhcpinterfaces","bond0",,
"ntpservers","mn01",,
"consoleondemand","yes",,
"sharedtftp","0",,
"nameservers","mn01",,
"installloc","/install",,

nodels node0001 noderes
qgpu0001: noderes.primarynic: eth0
qgpu0001: noderes.xcatmaster: sn02
qgpu0001: noderes.installnic: eth0
qgpu0001: noderes.netboot: pxe
qgpu0001: noderes.servicenode: sn02
qgpu0001: noderes.node: node0001
qgpu0001: noderes.nfsserver: sn02
qgpu0001: noderes.tftpserver:
qgpu0001: noderes.comments:
qgpu0001: noderes.nfsdir:
qgpu0001: noderes.disable:
qgpu0001: noderes.discoverynics:
qgpu0001: noderes.nimserver:
qgpu0001: noderes.cmdinterface:
qgpu0001: noderes.next_osimage:
qgpu0001: noderes.current_osimage:
qgpu0001: noderes.monserver:

lsdef sn02

Object name: sn02
    arch=x86_64
    bmc=sn02-bmc
    bmcport=0
    currchain=boot
    currstate=boot
    groups=service,ipmi,bnt102-service,x3650m2,all
    initrd=xcat/rhels5.4/x86_64/initrd.img
    installnic=eth0
    interface=eth0
    ip=XXXXXX
    kcmdline=nofb utf8 ks=http://mn01/install/autoinst/qservice02
ksdevice=eth0 console=tty0 console=ttyS0,115200 noipv6
    kernel=xcat/rhels5.4/x86_64/vmlinuz
    mac=E4:1F:13:44:F5:9C
    mgt=ipmi
    mtm=7945AC1
    netboot=pxe
    nfsserver=mn01
    os=rhels5.4
    postbootscripts=otherpkgs,setupntp,setupntp

postscripts=syslog,remoteshell,syncfiles,nwu.service,servicenode,xcatserver,xcatclient
    primarynic=eth0
    profile=service
    provmethod=install
    serial= 06GA470
    serialport=0
    serialspeed=115200
    servicenode=mn01
    setupconserver=1
    setupdhcp=1
    setupftp=1
    setupnameserver=1
    setupnfs=1
    setupnim=1
    setuptftp=1
    status=booting
    statustime=07-20-2011 16:25:39
    switch=bnt102
    switchport=8
    tftpserver=mn01
    xcatmaster=mn01

lsdef node0001

Object name: node0001
    arch=x86_64
    bmc=node0001-bmc
    bmcport=0
    chain=runcmd=bmcsetup,standby
    currchain=boot
    currstate=boot
    groups=gpu,ipmi,dx360m3,gpubnt01,gpurack01,all,allgpu
    initrd=xcat/rhels6.1/x86_64/initrd.img
    installnic=eth0
    interface=eth0
    ip=XXXXXX
    kcmdline=nofb utf8 ks=http://sn02/install/autoinst/qgpu0001
ksdevice=eth0 console=tty0 console=ttyS0,115200n8r noipv6
    kernel=xcat/rhels6.1/x86_64/vmlinuz
    mac=e4:1f:13:f0:80:9c
    mgt=ipmi
    mtm=6391AC1
    netboot=pxe
    nfsserver=sn02
    ondiscover=nodediscover
    os=rhels6.1
    postbootscripts=otherpkgs,setupntp,nwu.ipoib
    postscripts=syslog,remoteshell,syncfiles,nwu.ofed
    primarynic=eth0
    profile=gpu
    provmethod=install
    serial=06CGM96
    serialflow=hard
    serialport=0
    serialspeed=115200
    servicenode=sn02
    status=booted
    statustime=07-20-2011 22:23:36
    supportedarchs=x86,x86_64
    switch=gpubnt01
    switchinterface=eth0
    switchport=1
    switchvlan=1
    xcatmaster=sn02

Christian D. Caruthers
Linux HPC Consultant
STG Lab Services
757-656-9675


------------------------------------------------------------------------------
5 Ways to Improve & Secure Unified Communications
Unified Communications promises greater efficiencies for business. UC can 
improve internal communications as well as offer faster, more efficient ways
to interact with customers and streamline customer service. Learn more!
http://www.accelacomm.com/jaw/sfnl/114/51426253/
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to