Yes, that is correct. Below is the output : [root@hpc3-14-03 ~]# ip a 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eno1: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000 link/ether 08:f1:ea:9e:c7:60 brd ff:ff:ff:ff:ff:ff 3: eno2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000 link/ether 08:f1:ea:9e:c7:61 brd ff:ff:ff:ff:ff:ff 4: eno3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group default qlen 1000 link/ether 08:f1:ea:e4:35:52 brd ff:ff:ff:ff:ff:ff inet 10.240.58.16/23 brd 10.240.59.255 scope global eno3 valid_lft forever preferred_lft forever inet6 fe80::af1:eaff:fee4:3552/64 scope link valid_lft forever preferred_lft forever
.... What I have noticed is that dhcpd.leases file does not have an entry with an "fixed address" entry like the following: host hpc3-gpu-16-03 { dynamic; hardware ethernet 20:67:7c:10:ba:86; uid 20:67:7c:10:ba:86; fixed-address 10.240.58.61; supersede server.ddns-hostname = "hpc3-gpu-16-03"; supersede host-name = "hpc3-gpu-16-03"; if option user-class-identifier = "xNBA" and option client-architecture = 00:00 { supersede server.filename = "http:// ${next-server}:80/tftpboot/xcat/xnba/nodes/hpc3-gpu-16-03"; } elsif option client-architecture = 00:00 { supersede server.filename = "xcat/xnba.kpxe"; } else { supersede server.filename = ""; } } I am assuming the lease file got messed up somehow. What are your thoughts on reconstructing the file (programmatically) and using a modified file? Or is there another way from within xcat to add entries in the dhcpd leases file? thanks. On Wed, Sep 1, 2021 at 9:20 AM Russell Jones <arjone...@gmail.com> wrote: > Is the mac correct for the node? > > On Wed, Sep 1, 2021 at 11:02 AM Imam Toufique <techie...@gmail.com> wrote: > >> Hi, >> >> Need your helpful thoughts here with a problem we have, please. >> >> We have nodes that were provisioned with xcat, they are running, OS is >> working and installed. The boot order is set to PXE first, SSD 2nd. >> >> Several days ago, when I rebooted one of the nodes, it went straight to >> PXE discovery mode - attempting for an install. This is a node that is >> built, it should have exited the PXE boot mode and boot off the disk, but >> it never did. >> >> I am not sure what's going on, it looks like xcat has lost the status of >> the node, whether it is installed or not ( need provisioning?) >> >> Here is the 'lsdef' output of the node: >> >> ``` >> [root@mn] lsdef -t node hpc3-14-03 >> Object name: hpc3-14-03 >> arch=x86_64 >> cpucount=40 >> cputype=Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz >> currchain=boot >> currstate=boot >> disksize=sda:224GB,sdb:224GB >> groups=centos78 >> ip=10.240.58.16 >> mac=08:f1:ea:e4:35:52 >> memory=193122MB >> mtm=HPE:ProLiant XL170r Gen10 >> netboot=xnba >> nichostnamesuffixes.ib0=-ib0 >> nichostnamesuffixes.ipmi=-ipmi >> nicips.ib0=10.240.60.16 >> nicips.ipmi=10.240.62.16 >> os=centos7.7 >> postbootscripts=otherpkgs,hpc3-postscripts/hpc3postbootscript >> >> postscripts=syslog,remoteshell,syncfiles,setupntp,hpc3-postscripts/hpc3postscript.1,confignetwork >> -s >> profile=compute >> provmethod=centos7.8-x86_64-install-compute >> serial=2M294204L9 >> status=booted >> statustime=06-21-2021 16:41:45 >> supportedarchs=x86,x86_64 >> >> ``` >> >> ``` >> [root@mn]# nodediscoverls |grep 14-03 >> 38363730-3535-324D-3239-343230344C39 hpc3-14-03 manual >> HPE:ProLiant XL170r Gen10 2M294204L9 >> ``` >> >> ``` >> [root@mn]# lsdef -t network compute_net_1 >> >> Object name: compute_net_1 >> >> domain=local >> >> dynamicrange=10.240.58.221-10.240.58.240 >> >> gateway=10.240.58.1 >> >> mask=255.255.254.0 >> mgtifname=eno1 >> mtu=1500 >> nameservers=10.240.58.4,8.8.8.8,128.200.192.202 >> net=10.240.58.0 >> staticrange=10.240.58.4-10.240.59.220 >> tftpserver=<xcatmaster> >> ``` >> >> Any idea what might be going on here? Why an already setup/installed >> node is going back to discovery ( and wanting to be installed) mode? >> >> Can someone please shed some light? >> >> thanks a lot! >> >> >> _______________________________________________ >> xCAT-user mailing list >> xCAT-user@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/xcat-user >> > _______________________________________________ > xCAT-user mailing list > xCAT-user@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xcat-user > -- Regards, *Imam Toufique* *213-700-5485*
_______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user