Thanks very much again Russell. nodeset indeed was the missing piece. Since I looked at that command as more of a way to connect a node to an image I didn't think it needed to be run again. I now understand the process much better thanks to your help.
Best Regards, Steve On Wed, Nov 4, 2015 at 9:38 AM, Russell Auld <[email protected]> wrote: > The missing piece is nodeset. That will finally update the leases file. > So, in general, you would want to: > makehosts > makedhcp > nodeset > > > On Nov 4, 2015 9:26 AM, Stephen Cousins <[email protected]> wrote: > > > > Hi Russell, > > > > About the master and nameserver: they were both unchanged from prior to > the IP change and the manager is set up as a name server. I didn't change > the entries showing the 127.0.0.1 address and it was working previously so > I figured it would continue to work. I'll give it a try with 172.20.0.1 > though. > > > > The /var/lib/dhcp/dhcpd.leases file had entries with the old IP's so I > figured that maybe that was the problem. I stopped dhcpd and moved the > dhcpd.leases to /tmp just to get it out of there thinking that when I > started dhcpd up again it would create a new one and populate it as nodes > booted up. I created a blank dhcpd.leases file and started up dhcpd but > still the same thing when. > > > > I thought about it more and realized that that file was the missing > piece that I was asking about. That file is what maps the MAC addresses to > the static IP's. So, I moved the original dhcpd.leases file back and ran > > > >> sed -i -e s/172.0.10/172.20.10/g dhcpd.leases > > > > > > and then restarted dhcpd. Success! The nodes boot up with the correct > new IP. > > > > So, I guess my new question is: What is supposed to create or recreate > the dhcpd.leases file? Presumably it would be makedhcp but that doesn't > seem to be doing it. > > > > For completeness, I do have entries in the mac table and I did run chdef > for each node. > > > > Thanks very much for your help. It's always helpful to have someone else > look at things. Especially when the original eyes (mine) don't know exactly > what to look for. > > > > Best regards, > > > > Steve > > > > On Wed, Nov 4, 2015 at 8:34 AM, Russell Auld <[email protected]> > wrote: > >> > >> You will definitely need to change "master" to the IP address that the > nodes will see - 172.20.0.1 > >> You should also set the "nameservers" to something that actually serves > DNS. It wouldn't be a bad thing to run a DNS server on the xCAT master > node, but you don't absolutely have to. > >> Everything else looks ok, but yes, you're right that you do need to > tell xCAT about the nodes and their MACs. > >> Do you see anything in /var/lib/dhcpd.leases? > >> Do you have any nodes defined? (nodels command). > >> You either need to statically define the nodes, via something like > "chdef n101 mac=00:25:90:29:4b:40 groups=all" > >> or use auto node discovery - but something has to set the MAC addresses > in xCAT. > >> > >> > >> ________________________________ > >> From: "Stephen Cousins" <[email protected]> > >> To: "xCAT Users Mailing list" <[email protected]> > >> Sent: Tuesday, November 3, 2015 11:46:48 PM > >> Subject: Re: [xcat-user] {Disarmed} Re: Network change -> dhcpd: > DHCPDISCOVER: no free leases > >> > >> > >> The OS for the head node is CentOS 7 > >> > >> ifconfig -a: > >>> > >>> enp11s0f0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 9000 > >>> inet 10.54.200.200 netmask 255.255.0.0 broadcast > 10.54.255.255 > >>> inet6 fe80::225:90ff:fe18:c30e prefixlen 64 scopeid > 0x20<link> > >>> ether 00:25:90:18:c3:0e txqueuelen 1000 (Ethernet) > >>> RX packets 4361702 bytes 3262025237 (3.0 GiB) > >>> RX errors 0 dropped 170 overruns 170 frame 0 > >>> TX packets 2824086 bytes 1195929007 (1.1 GiB) > >>> TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 > >>> device memory 0xfbe20000-fbe3ffff > >>> > >>> enp11s0f1: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500 > >>> inet 172.20.0.1 netmask 255.255.0.0 broadcast 172.20.255.255 > >>> inet6 fe80::225:90ff:fe18:c30f prefixlen 64 scopeid > 0x20<link> > >>> ether 00:25:90:18:c3:0f txqueuelen 1000 (Ethernet) > >>> RX packets 4376023 bytes 669599743 (638.5 MiB) > >>> RX errors 0 dropped 0 overruns 0 frame 0 > >>> TX packets 11575987 bytes 2473097440 (2.3 GiB) > >>> TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 > >>> device memory 0xfbee0000-fbefffff > >>> > >>> enp8s0f0: flags=4098<BROADCAST,MULTICAST> mtu 1500 > >>> ether 00:e0:ed:17:70:00 txqueuelen 1000 (Ethernet) > >>> RX packets 0 bytes 0 (0.0 B) > >>> RX errors 0 dropped 0 overruns 0 frame 0 > >>> TX packets 0 bytes 0 (0.0 B) > >>> TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 > >>> > >>> enp8s0f1: flags=4098<BROADCAST,MULTICAST> mtu 1500 > >>> ether 00:e0:ed:17:70:01 txqueuelen 1000 (Ethernet) > >>> RX packets 0 bytes 0 (0.0 B) > >>> RX errors 0 dropped 0 overruns 0 frame 0 > >>> TX packets 0 bytes 0 (0.0 B) > >>> TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 > >>> > >>> ib0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 65520 > >>> inet 10.55.50.100 netmask 255.255.0.0 broadcast 10.55.255.255 > >>> inet6 fe80::202:c903:7:f353 prefixlen 64 scopeid 0x20<link> > >>> Infiniband hardware address can be incorrect! Please read BUGS section > in ifconfig(8). > >>> infiniband > 80:00:00:48:FE:80:00:00:00:00:00:00:00:00:00:00:00:00:00:00 txqueuelen 256 > (InfiniBand) > >>> RX packets 2407037 bytes 68324440907 (63.6 GiB) > >>> RX errors 0 dropped 0 overruns 0 frame 0 > >>> TX packets 2894334 bytes 95158435534 (88.6 GiB) > >>> TX errors 0 dropped 4 overruns 0 carrier 0 collisions 0 > >>> > >>> lo: flags=73<UP,LOOPBACK,RUNNING> mtu 65536 > >>> inet 127.0.0.1 netmask 255.0.0.0 > >>> inet6 ::1 prefixlen 128 scopeid 0x10<host> > >>> loop txqueuelen 0 (Local Loopback) > >>> RX packets 7685623 bytes 2146959369 (1.9 GiB) > >>> RX errors 0 dropped 0 overruns 0 frame 0 > >>> TX packets 7685623 bytes 2146959369 (1.9 GiB) > >>> TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 > >> > >> > >> > >> Networks table: > >>> > >>> [root@gromit ~]# tabdump networks > >>> > #netname,net,mask,mgtifname,gateway,dhcpserver,tftpserver,nameservers,ntpservers,logservers,dynamicrange,staticrange,staticrangeincrement,nodehostname,ddnsdomain,vlanid,domain,comments,disable > >>> > "clusternet","172.20.0.0","255.255.0.0","enp11s0f1","<xcatmaster>","172.20.0.1","172.20.0.1","<xcatmaster>","172.20.0.1",,,"172.20.10.101-172.20.10.254",,,,,,, > >>> "publicnet","10.54.0.0","255.255.0.0","enp11s0f0",,,,,,,,,,,,,,, > >>> > "172_21_0_0-255_255_0_0","172.21.0.0","255.255.0.0","ib0","<xcatmaster>",,"172.21.0.1",,,,,,,,,,,, > >> > >> > >> Site: > >>> > >>> master is 127.0.0.1 which was not changed. Should it be 172.20.0.1? > >>> > >>> nameserver too is 127.0.0.1 and unchanged. > >>> > >> xCAT version: 2.9.1 > >> > >> . > >> > >> Do you know how the DHCP server picks up the information about what MAC > addresses are mapped to the IP addresses? I think that ultimately the > information comes from the mac and node and hosts tables but I mean what > files are the DHCP using to pick up that information. I don't see it in the > usual place. > >> > >> Thanks a lot for your help Russell. > >> > >> Steve > >> > >> > >> On Tue, Nov 3, 2015 at 10:41 PM, Russell Auld <[email protected]> > wrote: > >>> > >>> What OS is the head node running? > >>> Can you show what the network settings on the head node are? E.g.: > "ifconfig -a" or "ip addr show" > >>> Can you show the full output of the networks table? > >>> You will definitely need to change some things in the site table. > nameservers and master to start with. > >>> Also, which version of xcat are you using? > >>> > >>> ________________________________ > >>> From: "Stephen Cousins" <[email protected]> > >>> To: "xCAT Users Mailing list" <[email protected]> > >>> Sent: Tuesday, November 3, 2015 3:16:09 PM > >>> Subject: Re: [xcat-user] Network change -> dhcpd: DHCPDISCOVER: no > free leases > >>> > >>> > >>> Anything to look for specifically? Here is what tabdump site shows: > >>> > >>>> #key,value,comments,disable > >>>> "blademaxp","64",, > >>>> "fsptimeout","0",, > >>>> "installdir","/install",, > >>>> "ipmimaxp","64",, > >>>> "ipmiretries","3",, > >>>> "ipmitimeout","2",, > >>>> "consoleondemand","no",, > >>>> "master","127.0.0.1",, > >>>> "forwarders","$our-name-server",, > >>>> "nameservers","127.0.0.1",, > >>>> "maxssh","8",, > >>>> "ppcmaxp","64",, > >>>> "ppcretry","3",, > >>>> "ppctimeout","0",, > >>>> "powerinterval","0",, > >>>> "syspowerinterval","0",, > >>>> "sharedtftp","1",, > >>>> "SNsyncfiledir","/var/xcat/syncfiles",, > >>>> "nodesyncfiledir","/var/xcat/node/syncfiles",, > >>>> "tftpdir","/tftpboot",, > >>>> "xcatdport","3001",, > >>>> "xcatiport","3002",, > >>>> "xcatconfdir","/etc/xcat",, > >>>> "timezone","America/New_York",, > >>>> "useNmapfromMN","no",, > >>>> "enableASMI","no",, > >>>> "db2installloc","/mntdb2",, > >>>> "databaseloc","/var/lib",, > >>>> "sshbetweennodes","ALLGROUPS",, > >>>> "dnshandler","ddns",, > >>>> "vsftp","n",, > >>>> "cleanupxcatpost","no",, > >>>> "dhcplease","43200",, > >>>> "auditnosyslog","0",, > >>>> "dhcpinterfaces","enp11s0f1",, > >>>> "domain","cluster",, > >>> > >>> > >>> Thanks, > >>> > >>> Steve > >>> > >>> On Tue, Nov 3, 2015 at 2:11 PM, Russell Auld <[email protected]> > wrote: > >>>> > >>>> Check the site table. > >>>> > >>>> On Nov 3, 2015 1:14 PM, Stephen Cousins <[email protected]> > wrote: > >>>> > > >>>> > Hi, > >>>> > > >>>> > I had improperly set my network to 172.0.0.0/16 thinking that that > was in private space. I later found that 172.16.0.0/12 is the real > private space so I changed entries in all of the tables to 172.20.*.*. I > also changed entries in /etc/hosts, and then reran makedns -n and makedhcp > -n. After restarting everything DHCP starts up fine but when I boot a node > it comes up with entries like: > >>>> > > >>>> > Nov 3 12:50:49 gromit dhcpd: DHCPDISCOVER from > 00:25:90:29:4b:40 via enp11s0f1: network enp11s0f1: no free leases > >>>> > > >>>> > So, I seem to not have gotten everything changed. The networks > table has a staticrange of: 172.20.10.101-172.20.10.254: > >>>> > > >>>> > > "clusternet","172.20.0.0","255.255.0.0","enp11s0f1","<xcatmaster>","172.20.0.1","172.20.0.1","<xcatmaster>","172.20.0.1",,,"172.20.10.101-172.20.10.254",,,,,,, > >>>> > > >>>> > > >>>> > Here is what the generated dhcpd.conf file looks like: > >>>> > > >>>> >> #xCAT generated dhcp configuration > >>>> >> > >>>> >> authoritative; > >>>> >> option conf-file code 209 = text; > >>>> >> option space isan; > >>>> >> option isan-encap-opts code 43 = encapsulate isan; > >>>> >> option isan.iqn code 203 = string; > >>>> >> option isan.root-path code 201 = string; > >>>> >> option space gpxe; > >>>> >> option gpxe-encap-opts code 175 = encapsulate gpxe; > >>>> >> option gpxe.bus-id code 177 = string; > >>>> >> option user-class-identifier code 77 = string; > >>>> >> option gpxe.no-pxedhcp code 176 = unsigned integer 8; > >>>> >> option tcode code 101 = text; > >>>> >> option iscsi-initiator-iqn code 203 = string; > >>>> >> ddns-update-style interim; > >>>> >> ignore client-updates; > >>>> >> option client-architecture code 93 = unsigned integer 16; > >>>> >> option tcode "America/New_York"; > >>>> >> option gpxe.no-pxedhcp 1; > >>>> >> > >>>> >> omapi-port 7911; > >>>> >> key xcat_key { > >>>> >> algorithm hmac-md5; > >>>> >> secret "==============my_secret==================="; > >>>> >> }; > >>>> >> omapi-key xcat_key; > >>>> >> class "pxe" { > >>>> >> match if substring (option vendor-class-identifier, 0, 9) = > "PXEClient"; > >>>> >> ddns-updates off; > >>>> >> max-lease-time 600; > >>>> >> } > >>>> >> shared-network enp11s0f1 { > >>>> >> subnet 172.20.0.0 netmask 255.255.0.0 { > >>>> >> max-lease-time 43200; > >>>> >> min-lease-time 43200; > >>>> >> default-lease-time 43200; > >>>> >> option routers 172.20.0.1; > >>>> >> next-server 172.20.0.1; > >>>> >> option log-servers 172.20.0.1; > >>>> >> option ntp-servers 172.20.0.1; > >>>> >> option domain-name "cluster"; > >>>> >> option domain-name-servers 172.20.0.1; > >>>> >> option domain-search "cluster"; > >>>> >> zone cluster. { > >>>> >> primary 172.20.0.1; key xcat_key; > >>>> >> } > >>>> >> zone 20.172.IN-ADDR.ARPA. { > >>>> >> primary 172.20.0.1; key xcat_key; > >>>> >> } > >>>> >> if option user-class-identifier = "xNBA" and option > client-architecture = 00:00 { #x86, xCAT Network Boot Agent > >>>> >> always-broadcast on; > >>>> >> filename = "MailScanner has detected a possible fraud > attempt from "172.20.0.1" claiming to be > http://172.20.0.1/tftpboot/xcat/xnba/nets/172.20.0.0_16"; > >>>> >> } else if option user-class-identifier = "xNBA" and option > client-architecture = 00:09 { #x86, xCAT Network Boot Agent > >>>> >> filename = "MailScanner has detected a possible fraud > attempt from "172.20.0.1" claiming to be > http://172.20.0.1/tftpboot/xcat/xnba/nets/172.20.0.0_16.uefi"; > >>>> >> } else if option client-architecture = 00:00 { #x86 > >>>> >> filename "xcat/xnba.kpxe"; > >>>> >> } else if option vendor-class-identifier = "Etherboot-5.4" { > #x86 > >>>> >> filename "xcat/xnba.kpxe"; > >>>> >> } else if option client-architecture = 00:07 { #x86_64 uefi > >>>> >> filename "xcat/xnba.efi"; > >>>> >> } else if option client-architecture = 00:09 { #x86_64 uefi > alternative id > >>>> >> filename "xcat/xnba.efi"; > >>>> >> } else if option client-architecture = 00:02 { #ia64 > >>>> >> filename "elilo.efi"; > >>>> >> } else if option client-architecture = 00:0e { #OPAL-v3 > >>>> >> option conf-file = "MailScanner has detected a possible > fraud attempt from "172.20.0.1" claiming to be > http://172.20.0.1/tftpboot/pxelinux.cfg/p/172.20.0.0_16"; > >>>> >> } else if substring(filename,0,1) = null { #otherwise, provide > yaboot if the client isn't specific > >>>> >> filename "/yaboot"; > >>>> >> } > >>>> >> } # 172.20.0.0/255.255.0.0 subnet_end > >>>> >> } # enp11s0f1 nic_end > >>>> >> > >>>> > I'm not sure how the static MAC addresses come into play with xCAT. > I am used to static entries being like: > >>>> > > >>>> >> > >>>> >> host n101 { > >>>> >> hardware ethernet 00:25:90:29:4b:40; > >>>> >> fixed-address 172.20.10.101; > >>>> >> } > >>>> >> > >>>> > > >>>> > I see that I'll likely have issues with tftpd as well. I'll come > try to work through those when I get there. > >>>> > > >>>> > I guess I have two questions after all of this: > >>>> > > >>>> > 1. What do I need to do now to get things fixed? > >>>> > 2. What would have been the correct way to do what I wanted to do. > >>>> > > >>>> > Thanks for your continued help with this project. > >>>> > > >>>> > Steve > >>>> > > >>>> > -- > >>>> > ________________________________________________________________ > >>>> > Steve Cousins Supercomputer Engineer/Administrator > >>>> > Advanced Computing Group University of Maine System > >>>> > 244 Neville Hall (UMS Data Center) (207) 561-3574 > >>>> > Orono ME 04469 steve.cousins at maine.edu > >>>> > > >>>> > ------------------------------------------------------------------------------ > >>>> _______________________________________________ > >>>> xCAT-user mailing list > >>>> [email protected] > >>>> https://lists.sourceforge.net/lists/listinfo/xcat-user > >>> > >>> > >>> > >>> > >>> -- > >>> ________________________________________________________________ > >>> Steve Cousins Supercomputer Engineer/Administrator > >>> Advanced Computing Group University of Maine System > >>> 244 Neville Hall (UMS Data Center) (207) 561-3574 > >>> Orono ME 04469 steve.cousins at maine.edu > >>> > >>> > >>> > ------------------------------------------------------------------------------ > >>> > >>> _______________________________________________ > >>> xCAT-user mailing list > >>> [email protected] > >>> https://lists.sourceforge.net/lists/listinfo/xcat-user > >>> > >>> > >>> > ------------------------------------------------------------------------------ > >>> > >>> _______________________________________________ > >>> xCAT-user mailing list > >>> [email protected] > >>> https://lists.sourceforge.net/lists/listinfo/xcat-user > >>> > >> > >> > >> > >> -- > >> ________________________________________________________________ > >> Steve Cousins Supercomputer Engineer/Administrator > >> Advanced Computing Group University of Maine System > >> 244 Neville Hall (UMS Data Center) (207) 561-3574 > >> Orono ME 04469 steve.cousins at maine.edu > >> > >> > >> > ------------------------------------------------------------------------------ > >> > >> _______________________________________________ > >> xCAT-user mailing list > >> [email protected] > >> https://lists.sourceforge.net/lists/listinfo/xcat-user > >> > >> > >> > ------------------------------------------------------------------------------ > >> > >> _______________________________________________ > >> xCAT-user mailing list > >> [email protected] > >> https://lists.sourceforge.net/lists/listinfo/xcat-user > >> > > > > > > > > -- > > ________________________________________________________________ > > Steve Cousins Supercomputer Engineer/Administrator > > Advanced Computing Group University of Maine System > > 244 Neville Hall (UMS Data Center) (207) 561-3574 > > Orono ME 04469 steve.cousins at maine.edu > > > > ------------------------------------------------------------------------------ > _______________________________________________ > xCAT-user mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/xcat-user > -- ________________________________________________________________ Steve Cousins Supercomputer Engineer/Administrator Advanced Computing Group University of Maine System 244 Neville Hall (UMS Data Center) (207) 561-3574 Orono ME 04469 steve.cousins at maine.edu
------------------------------------------------------------------------------
_______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user
