Verify you do not have dynamic and static networks overlapping for that
network definition. Also verify you have configured the correct MAC
address for that node in xcat and do not have overlapping MACs/IPs.
What does an lsdef for one of the problem nodes look like?
On 11/26/2013 2:54 PM, Damir Krstic wrote:
We have couple of new x3550m4 that are not installing. Basically
after BMC has been programmed and nodes have been set to install, and
for some reason, pxe boot process never goes beyond serving pxelinux.0
(please see the log file below:
ov 26 14:43:12 mgt dhcpd: DHCPACK on 172.20.7.1 to 40:f2:e9:0d:e2:64
via bond0
Nov 26 14:43:12 mgt atftpd[10629]: Serving pxelinux.0 to
172.20.7.1:1929 <http://172.20.7.1:1929>
Nov 26 14:43:12 mgt atftpd[10629]: tsize option -> 13148
Nov 26 14:43:12 mgt atftpd[10629]: blksize option -> 1468
Nov 26 14:43:12 mgt atftpd[10629]: Server thread exiting
Nov 26 14:43:12 mgt atftpd[10629]: Serving pxelinux.0 to
172.20.7.1:1930 <http://172.20.7.1:1930>
Nov 26 14:43:12 mgt atftpd[10629]: blksize option -> 1468
Nov 26 14:43:12 mgt atftpd[10629]: Server thread exiting
Nov 26 14:43:13 mgt atftpd[10629]: Serving pxelinux.0 to
172.20.7.1:1931 <http://172.20.7.1:1931>
Nov 26 14:43:13 mgt atftpd[10629]: blksize option -> 1468
Nov 26 14:43:13 mgt atftpd[10629]: Server thread exiting
Here is the tcpdump from the management node when this happens:
14:33:20.626124 IP (tos 0x0, ttl 64, id 50528, offset 0, flags
[none], proto: UDP (17), length: 68) <new-node>.informatik-lm > <mgt
node>: [udp sum ok] 40 RRQ "pxelinux.0" octet tsize 0 blksize 1468
in the /tftpboot/pxelinux.cfg directory we have a directory that
corresponds to the hex of the ip for the new node:
[root@mgt pxelinux.cfg]# ls -lrt AC140701
lrwxrwxrwx 1 root root 9 Nov 26 09:28 AC140702 -> ttlogin01
here is the content of the file:
root@mgt pxelinux.cfg]# cat ttlogin01
#install rhels6.2-x86_64-ttlogin6
DEFAULT xCAT
LABEL xCAT
KERNEL xcat/rhels6.2/x86_64/vmlinuz
APPEND initrd=xcat/rhels6.2/x86_64/initrd.img
repo=http://172.20.0.1/install/rhels6.2/x86_64/
ks=http://172.20.0.1/install/autoinst/ttlogin01 ksdevice=eth0 cmdline
console=tty0 console=ttyS0,115200
IPAPPEND 2
For some reason, tftpboot process never proceeds to the pxelinux.cfg
directory after pxelinux.0 is served.
Stateless nodes on this cluster boot fine so I think our tftpboot
environment is OK. It's just these two nodes that have to be
installed that are problematic.
Any help is appreciated.
Thanks,
Damir.
------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT
organizations don't have a clear picture of how application performance
affects their revenue. With AppDynamics, you get 100% visibility into your
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349351&iu=/4140/ostg.clktrk
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user
------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT
organizations don't have a clear picture of how application performance
affects their revenue. With AppDynamics, you get 100% visibility into your
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349351&iu=/4140/ostg.clktrk
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user