Verify you do not have dynamic and static networks overlapping for that network definition. Also verify you have configured the correct MAC address for that node in xcat and do not have overlapping MACs/IPs.

What does an lsdef for one of the problem nodes look like?


On 11/26/2013 2:54 PM, Damir Krstic wrote:
We have couple of new x3550m4 that are not installing. Basically after BMC has been programmed and nodes have been set to install, and for some reason, pxe boot process never goes beyond serving pxelinux.0 (please see the log file below:

ov 26 14:43:12 mgt dhcpd: DHCPACK on 172.20.7.1 to 40:f2:e9:0d:e2:64 via bond0 Nov 26 14:43:12 mgt atftpd[10629]: Serving pxelinux.0 to 172.20.7.1:1929 <http://172.20.7.1:1929>
Nov 26 14:43:12 mgt atftpd[10629]: tsize option -> 13148
Nov 26 14:43:12 mgt atftpd[10629]: blksize option -> 1468
Nov 26 14:43:12 mgt atftpd[10629]: Server thread exiting
Nov 26 14:43:12 mgt atftpd[10629]: Serving pxelinux.0 to 172.20.7.1:1930 <http://172.20.7.1:1930>
Nov 26 14:43:12 mgt atftpd[10629]: blksize option -> 1468
Nov 26 14:43:12 mgt atftpd[10629]: Server thread exiting
Nov 26 14:43:13 mgt atftpd[10629]: Serving pxelinux.0 to 172.20.7.1:1931 <http://172.20.7.1:1931>
Nov 26 14:43:13 mgt atftpd[10629]: blksize option -> 1468
Nov 26 14:43:13 mgt atftpd[10629]: Server thread exiting

Here is the tcpdump from the management node when this happens:
14:33:20.626124 IP (tos 0x0, ttl 64, id 50528, offset 0, flags [none], proto: UDP (17), length: 68) <new-node>.informatik-lm > <mgt node>: [udp sum ok] 40 RRQ "pxelinux.0" octet tsize 0 blksize 1468

in the /tftpboot/pxelinux.cfg directory we have a directory that corresponds to the hex of the ip for the new node:

[root@mgt pxelinux.cfg]# ls -lrt AC140701
lrwxrwxrwx 1 root root 9 Nov 26 09:28 AC140702 -> ttlogin01

here is the content of the file:
root@mgt pxelinux.cfg]# cat ttlogin01
#install rhels6.2-x86_64-ttlogin6
DEFAULT xCAT
LABEL xCAT
 KERNEL xcat/rhels6.2/x86_64/vmlinuz
APPEND initrd=xcat/rhels6.2/x86_64/initrd.img repo=http://172.20.0.1/install/rhels6.2/x86_64/ ks=http://172.20.0.1/install/autoinst/ttlogin01 ksdevice=eth0 cmdline console=tty0 console=ttyS0,115200
  IPAPPEND 2

For some reason, tftpboot process never proceeds to the pxelinux.cfg directory after pxelinux.0 is served.

Stateless nodes on this cluster boot fine so I think our tftpboot environment is OK. It's just these two nodes that have to be installed that are problematic.

Any help is appreciated.

Thanks,
Damir.



------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT
organizations don't have a clear picture of how application performance
affects their revenue. With AppDynamics, you get 100% visibility into your
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349351&iu=/4140/ostg.clktrk


_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user

------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349351&iu=/4140/ostg.clktrk
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to