"dhcpinterfaces","mgmt|eno1",, <<<<<<<<<< --- is this correct?
from your previous output, xcat-test is node name and eth0 is the interface name [A[root@xcat-test ~]# xcatprobe xcatmn -i eth0 Thanks, Casandra Qiu ................................................................... Casandra Hong Qiu Phone: (845) 433-9291, t/l 293-9291 Office: Building 8, 3-B-04 cxh...@us.ibm.com From: John Roche <john.ro...@crick.ac.uk> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Date: 05/28/2020 10:43 AM Subject: [EXTERNAL] Re: [xcat-user] PXE-E18: Server response timeout. Hi Casandra Here is the site table The master address is set to the test server #key,value,comments,disable "blademaxp","64",, "domain","xxxxxxxxxxxxxx",, "fsptimeout","0",, "installdir","/install",, "ipmimaxp","64",, "ipmiretries","3",, "ipmitimeout","2",, "consoleondemand","yes",, "master","10.28.9.183",, "maxssh","8",, "ppcmaxp","64",, "ppcretry","3",, "ppctimeout","0",, "powerinterval","0",, "syspowerinterval","0",, "sharedtftp","1",, "SNsyncfiledir","/var/xcat/syncfiles",, "nodesyncfiledir","/var/xcat/node/syncfiles",, "tftpdir","/tftpboot",, "xcatdport","3001",, "xcatiport","3002",, "xcatconfdir","/etc/xcat",, "timezone","Europe/London",, "useNmapfromMN","no",, "enableASMI","no",, "db2installloc","/mntdb2",, "databaseloc","/var/lib",, "sshbetweennodes","ALLGROUPS",, "dnshandler","ddns",, "vsftp","n",, "cleanupxcatpost","no",, "dhcplease","43200",, "auditnosyslog","0",, "xcatsslversion","TLSv1",, "dhcpinterfaces","mgmt|eno1",, "nmapoptions","--min-rtt-timeout 1s",, "FQDNfirst","y",, "xcatdebugmode","0",, did u run `rinstall` again after you clean up dhcp lease on the prod cluster? Yes I did this, how did u pull the node out from prod cluster to this new server? same ip address/hostname, right? Correct yes Thanks for your help on this JR From: Casandra H Qiu <cxh...@us.ibm.com> Sent: 27 May 2020 18:25 To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Subject: Re: [xcat-user] PXE-E18: Server response timeout. if you add master ip address to the /etc/resolv.conf, the hostname should be resolved. did u run `rinstall` again after you clean up dhcp lease on the prod cluster? can u show me node definition here? and master/domain/dhcpinterface on the site table? if you can run rinstall, the mac address should be set and should not get to the dodiscover phase. how did u pull the node out from prod cluster to this new server? same ip address/hostname, right? Thanks, Casandra Qiu ................................................................... Casandra Hong Qiu Phone: (845) 433-9291, t/l 293-9291 Office: Building 8, 3-B-04 cxh...@us.ibm.com Inactive hide details for John Roche ---05/27/2020 01:00:20 PM---Hi Casandra nslookup works for the hostnameJohn Roche ---05/27/2020 01:00:20 PM---Hi Casandra nslookup works for the hostname From: John Roche <john.ro...@crick.ac.uk> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Date: 05/27/2020 01:00 PM Subject: [EXTERNAL] Re: [xcat-user] PXE-E18: Server response timeout. Hi Casandra nslookup works for the hostname but it doesn’t for the IP address could this be a problem? JR From: Casandra H Qiu <cxh...@us.ibm.com> Sent: 27 May 2020 17:14 To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Subject: Re: [xcat-user] PXE-E18: Server response timeout. this is example of detect_dhcpd output: # xcatprobe detect_dhcpd -i enP50p3s0f0 -m 70:e2:84:14:29:10 Start to detect DHCP, please wait 10 seconds [INFO] ++++++++++++++++++++++++++++++++++ [INFO] There are 1 servers replied to dhcp discover. [INFO] Server:172.20.253.31 assign IP [172.20.226.1]. The next server is [172.20.253.31]! [INFO] ++++++++++++++++++++++++++++++++++ [INFO] can u check your hostname/ip address? Server: XX.XX.X.XXX assign IP [XX.XX.X.XXX]. The next server is [XX.XX.X.XXX]! [INFO] ß--- this is the prod xcat server, how do I remove this?? Server: XX.XX.X.XXX assign IP [ZZ.ZZ.Z.ZZZ]. The next server is [ZZ.ZZ.Z.ZZZ]! [INFO] <- THIS IS THE NEW TEST XCAT SERVER ^^^^^^ ^^^^^^ |------------------------------------those two should be same --------------| what's the output from nslookup? Thanks, Casandra Qiu ................................................................... Casandra Hong Qiu Phone: (845) 433-9291, t/l 293-9291 Office: Building 8, 3-B-04 cxh...@us.ibm.com Inactive hide details for John Roche ---05/27/2020 11:29:13 AM---This is what I see when doing the rinstall xcat.genesis.dodiscJohn Roche ---05/27/2020 11:29:13 AM---This is what I see when doing the rinstall xcat.genesis.dodiscovery: My findme request is still unde From: John Roche <john.ro...@crick.ac.uk> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Date: 05/27/2020 11:29 AM Subject: [EXTERNAL] Re: [xcat-user] PXE-E18: Server response timeout. This is what I see when doing the rinstall xcat.genesis.dodiscovery: My findme request is still under processing, do not send new request xcat.genesis.dodiscovery: My findme request is still under processing, do not send new request xcat.genesis.minixcatd: The request is already processed by xCAT master, but not matched. xcat.genesis.dodiscovery: Beginning echo information to discovery packet file... xcat.genesis.dodiscovery: Discovery packet file is ready. xcat.genesis.dodiscovery: Sending the discovery packet to xCAT (XX.XX.X.XXX:3001)... Thanks JR From: John Roche <john.ro...@crick.ac.uk> Sent: 27 May 2020 16:01 To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Subject: Re: [xcat-user] PXE-E18: Server response timeout. Hi Casandra Thanks for the reply, Ran those commands you And also did a nodepurge cloud001 on the prod server And It’s still picking up the prod servers address when doing an rinstall ( John From: Casandra H Qiu <cxh...@us.ibm.com> Sent: 27 May 2020 15:42 To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Subject: Re: [xcat-user] PXE-E18: Server response timeout. did u run this command on XX.XX.X.XXX server? makedhcp -d nodename and two more command to clean up from prod cluster, you should remove the node definition too, at lease need to remove mac address. makedns -d nodename makegocons -d nodename then on the new test cluster: makedns nodename makegocons nodename makedhcp nodename nslookup nodename <<<<<--- make sure it return correct address thanks, Casandra Qiu ................................................................... Casandra Hong Qiu Phone: (845) 433-9291, t/l 293-9291 Office: Building 8, 3-B-04 cxh...@us.ibm.com Inactive hide details for John Roche ---05/27/2020 10:22:47 AM---Hi ddj, Yes I’m pulling a few nodes from the prod cluster,John Roche ---05/27/2020 10:22:47 AM---Hi ddj, Yes I’m pulling a few nodes from the prod cluster, From: John Roche <john.ro...@crick.ac.uk> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Date: 05/27/2020 10:22 AM Subject: [EXTERNAL] Re: [xcat-user] PXE-E18: Server response timeout. Hi ddj, Yes I’m pulling a few nodes from the prod cluster, I did makedhcp -d nodename and this sort of helped It does a pxe boot, but still picks up the prod xcat server address xcatprobe detect_dhcpd -i <MN interface name> -m <node's mac address> Server: XX.XX.X.XXX assign IP [XX.XX.X.XXX]. The next server is [XX.XX.X.XXX]! [INFO] ß--- this is the prod xcat server, how do I remove this?? Server: XX.XX.X.XXX assign IP [ZZ.ZZ.Z.ZZZ]. The next server is [ZZ.ZZ.Z.ZZZ]! [INFO] <- THIS IS THE NEW TEST XCAT SERVER Is there any way to remove this, will this help? JR From: David D Johnson <david_john...@brown.edu> Sent: 27 May 2020 12:23 To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Subject: Re: [xcat-user] PXE-E18: Server response timeout. When you have two dhcp servers on the same wire/subnet, need to make sure that the MAC address is only recognized by one of them. If the "production" server is running xcat, use makedhcp -d nodename to get rid of the association over there, and presume (or check, with makedhcp -q nodename) that it's registered in your new cluster. My guess is that you pulled out a compute node from the production cluster to play with. -- ddj On May 27, 2020, at 5:08 AM, John Roche < john.ro...@crick.ac.uk> wrote: [A[root@xcat-test ~]# xcatprobe xcatmn -i eth0 [mn]: Checking all xCAT daemons are running... [ OK ] [mn]: Checking xcatd can receive command request... [ OK ] [mn]: Checking 'site' table is configured... [ OK ] [mn]: Checking provision network is configured... [ OK ] [mn]: Checking 'passwd' table is configured... [ OK ] [mn]: Checking important directories (installdir,tftpdir) are configured... [ OK ] [mn]: Checking SELinux is disabled... [ OK ] [mn]: Checking HTTP service is configured... [ OK ] [mn]: Checking TFTP service is configured... [ OK ] [mn]: Checking DNS service is configured... [ OK ] [mn]: Checking DHCP service is configured... [ OK ] [mn]: Checking NTP service is configured... [ OK ] [mn]: Checking rsyslog service is configured... [ OK ] [mn]: Checking firewall is disabled... [ OK ] [mn]: Checking minimum disk space for xCAT ['/var' needs 1GB;'/install' needs 10GB;'/tmp' needs 1GB]... [ OK ] [mn]: Checking Linux ulimits configuration... [ OK ] [mn]: Checking network kernel parameter configuration... [ OK ] [mn]: Checking xCAT daemon attributes configuration... [ OK ] [mn]: Checking xCAT log is stored in /var/log/xcat/cluster.log... [ OK ] [mn]: Checking xCAT management node IP: <10.28.9.183> is configured to static... [ OK ] [mn]: Checking dhcpd.leases file is less than 100M... [ OK ] =================================== SUMMARY ==================================== [MN]: Checking on MN... [ OK ] That’s all good xcatprobe detect_dhcpd -i <MN interface name> -m <node's mac address> Start to detect DHCP, please wait 10 seconds [INFO] +++++++++++++++++++++++++++ +++++++ [INFO] There are 3 servers replied to dhcp discover. [INFO] Server: XX.XX.X.XXX assign IP [XX.XX.X.XXX]. The next server is [XX.XX.X.XXX]! [INFO] ß--- this is the prod xcat server, how do I remove this?? Server: XX.XX.X.XXX assign IP [ZZ.ZZ.Z.ZZZ]. The next server is [ZZ.ZZ.Z.ZZZ]! [INFO] <- THIS IS THE NEW TEST XCAT SERVER Server:JJ.JJ.JJ.J assign IP [JJ.JJ.J.J]. The next server is [ZZ.ZZ.Z.ZZZ]! [INFO] +++++++++++++++++++++++++++ +++++++ [INFO] [root@xcat-test ~]# From: Casandra H Qiu < cxh...@us.ibm.com> Sent: 26 May 2020 17:17 To: xCAT Users Mailing list < xcat-user@lists.sourceforge.net > Subject: Re: [xcat-user] PXE-E18: Server response timeout. try those two command to diagnosis if there are configure issues xcatprobe xcatmn -i <MN interface name> xcatprobe detect_dhcpd -i <MN interface name> -m <node's mac address> Thanks, Casandra Qiu ................................................................... Casandra Hong Qiu Phone: (845) 433-9291, t/l 293-9291 Office: Building 8, 3-B-04 cxh...@us.ibm.com <image001.gif>John Roche ---05/26/2020 12:11:36 PM---Tried this No luck From: John Roche < john.ro...@crick.ac.uk> To: xCAT Users Mailing list < xcat-user@lists.sourceforge.net > Date: 05/26/2020 12:11 PM Subject: Re: [xcat-user] [External] PXE-E18: Server response timeout. Tried this No luck How did people set up the dhpc part for doing a pxe boot? John From: Jarrod Johnson < jjohns...@lenovo.com> Sent: 22 May 2020 17:25 To: xCAT-user@lists.sourceforge.net Subject: Re: [xcat-user] [External] PXE-E18: Server response timeout. Hmm, nodeset <nodename> stat Rinstall thould have taken care of that, but checking... From: John Roche <john.ro...@crick.ac.uk> Sent: Friday, May 22, 2020 8:40 AM To: xCAT-user@lists.sourceforge.net < xCAT-user@lists.sourceforge.net> Subject: [External] [xcat-user] PXE-E18: Server response timeout. Hi, I recently build an xcat server Rpower works But doing an rinstall, it keeps failing at the PXE boot I’m doing rinstal <node name> -u It uses UEFI I’m starting to pull my hair now on what’s the issue Here is the error I see when I’m doing an rcons. Thanks Boot Failed - PXE Network: PciRoot(0x0)/Pci (0x2,0x0)/Pci(0x0,0x0)/Ctrl(0x1)/MAC (E41D2DDE7931,0x0)/IPv4 (0.0.0.0,0x0,DHCP,0.0.0.0,0.0.0.0,0.0.0.0) >>Start PXE over IPv4. PXE-E18: Server response timeout. >>Start PXE over IPv4. PXE-E18: Server response timeout. Boot Failed - PXE Network: PciRoot(0x0)/Pci (0x2,0x0)/Pci(0x0,0x0)/Ctrl(0x2)/MAC (E41D2DDE7932,0x0)/IPv4 (0.0.0.0,0x0,DHCP,0.0.0.0,0.0.0.0,0.0.0.0) >>Start PXE over IPv4. The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=n1LR_Py9TQX0dVqfGTbLHUMGx25-C8VtBDS0nCzyNXY&m=PLRL8iFiPji4r_6iqudmAElVB25ug0riOQi7R83WiXA&s=IBUXAcujswvqsEBYLnkFX3OUi6MxnP3QGdrWVDKrEsM&e=
_______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user