XCATBYPASS means you are not running with xcatd. If the problem is running with xcatd, I would like to understand why. That would be a bug.
Lissa K. Valletta 8-3/B10 Poughkeepsie, NY 12601 (tie 293) 433-3102 From: Michael Robbert <mrobb...@mines.edu> To: <xcat-user@lists.sourceforge.net>, Date: 08/16/2013 01:10 PM Subject: Re: [xcat-user] New nodes not deploying Lissa, When I tried steps 1 and 2 it worked. I haven't had time to look at it deeper so I don't know if something else changed since the last time I tried or if XCATBYPASS is what fixed it. I'll try to let you know if I do find anything, but for now don't worry about it. Mike On 8/14/13 4:00 AM, Lissa Valletta wrote: > I have a couple of debug ideas. > 1) export XCATBYPASS=1 ( that makes you run without the daemon for > debugging) > 2) run your nodeset command -- do you see errors > > 3) unset XCATBYPASS > service xcatd stop > /opt/xcat/sbin/xcatd -f ( that runs the daemon in the foreground > In another window, run nodeset, Do you see anything errors being > displayed the daemon > > 4) Get out of this by service xcatd start in the other window. That > should shutdown the xcatd -f and bring you up normal. If not, ps -ef > | grep xcatd and kill -9 any xcatd process and then service xcatd restart. > > If none of this works. Can we get a copy of your database. I would > tabprune -a auditlog ( if you are using auditlog) and then > dumpxCATdb, tar and compress it and send it to lis...@us.ibm.com. > The only other option is can I get on the system. You can send a note > to that same userid. > > Are you just using sqlite ( the default database)? > > Lissa K. Valletta > 8-3/B10 > Poughkeepsie, NY 12601 > (tie 293) 433-3102 > > > > Inactive hide details for Michael Robbert ---08/09/2013 11:33:00 > AM---Here is a diff comparing a working and non-working node. Michael > Robbert ---08/09/2013 11:33:00 AM---Here is a diff comparing a working > and non-working node. [root@mgmt ~]# diff /tmp/compute004.def /tm > > From: Michael Robbert <mrobb...@mines.edu> > To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>, > Date: 08/09/2013 11:33 AM > Subject: Re: [xcat-user] New nodes not deploying > > ------------------------------------------------------------------------ > > > > Here is a diff comparing a working and non-working node. > > [root@mgmt ~]# diff /tmp/compute004.def /tmp/compute084.def > 1c1 > < Object name: compute004 > --- > > Object name: compute084 > 3c3 > < bmc=compute004-bmc > --- > > bmc=compute084-bmc > 7c7,8 > < currstate=netboot centos6.3-x86_64-compute > --- > > currchain=runcmd=standby > > currstate=runcmd=standby > 9c10 > < initrd=xcat/osimage/mycomputeimage/initrd-stateless.gz > --- > > initrd=xcat/genesis.fs.x86_64.lzma > 11,14c12,15 > < ip=172.17.8.4 > < > kcmdline=imgurl=http://172.17.0.1:80//install/netboot/centos6.3/x86_64/compute/rootimg.gz > XCAT=!myipfn!:3001 NODE=compute004 ifname=eth0:00:30:48:f2:87:c4 > netdev=eth0 console=tty0 console=ttyS0,115200n8r > < kernel=xcat/osimage/mycomputeimage/kernel > < mac=00:30:48:f2:87:c4 > --- > > ip=172.17.8.84 > > kcmdline=quiet console=tty0 console=ttyS0,115200 > xcatd=172.17.0.1:3001 destiny=runcmd=standby > > kernel=xcat/genesis.kernel.x86_64 > > mac=00:25:90:19:92:52 > 21c22 > < otherinterfaces=compute004-bmc:172.17.32.4 > --- > > otherinterfaces=compute084-bmc:172.17.32.84 > 32,33c33,34 > < status=failed > < statustime=08-05-2013 16:49:52 > --- > > status=configuring > > statustime=08-08-2013 13:35:03 > 35,36d35 > < updatestatus=synced > < updatestatustime=07-29-2013 15:36:16 > > > The only differences I see are IPs and node unique numbering plus the > things that I think nodeset should be changing for me. > I run this: > > [root@mgmt ~]# nodeset compute084 osimage=mycomputeimage > compute084: netboot centos6.3-x86_64-compute > > and nothing changes. > I have tried running xcatdebug to see if I could see what is happening > under the covers, but when I do the daemon stops responding to commands > and needs to be restarted. > Also regarding the switch name, it is being populated by the > nodediscovery process. I'm not using switch discovery though, just > sequential discovery. > > Mike > ________________________________________ > From: Lissa Valletta [lis...@us.ibm.com] > Sent: Friday, August 09, 2013 7:55 > To: xCAT Users Mailing list > Subject: Re: [xcat-user] New nodes not deploying > > It the last suggestion does not work, then I would take the two lsdef's > and remove any attribute in the bad node that is not in the good > node. until you have the exact same attributes defined in the bad node > as in the good node. > > Lissa K. Valletta > 8-3/B10 > Poughkeepsie, NY 12601 > (tie 293) 433-3102 > > > > [Inactive hide details for Michael Robbert ---08/08/2013 03:30:23 > PM---Didn't work. I recreated dhcp with those commands. I had]Michael > Robbert ---08/08/2013 03:30:23 PM---Didn't work. I recreated dhcp with > those commands. I had run makedhcp before, but probably not with > > From: Michael Robbert <mrobb...@mines.edu> > To: <xcat-user@lists.sourceforge.net>, > Date: 08/08/2013 03:30 PM > Subject: Re: [xcat-user] New nodes not deploying > > ________________________________ > > > > Didn't work. I recreated dhcp with those commands. I had run makedhcp > before, but probably not with -n. The nodes are and were showing up in > the leases file so that didn't appear to be the problem. I also made > some changes to the nodehm table so that the serial attributes are > showing up on the new nodes. Still nodeset runs without error, but > doesn't change the tftp config file for the node. > I'm just not sure how to debug this. It is just failing silently. > > Thanks for any tips, > Mike > > On 8/8/13 5:16 AM, Lissa Valletta wrote: > > Thanks for all the good information! > > > > Did you run makedhcp -a to pick up the new nodes . Also you will need > > to run makeconservercf to pick up the new nodes. > > You might need to run makedhcp -n followed by makedhcp -a. > > > > Also these attributes are missing in the new nodes > > serialflow=hard > > serialport=0 > > serialspeed=115200 > > > > > > Lissa K. Valletta > > 8-3/B10 > > Poughkeepsie, NY 12601 > > (tie 293) 433-3102 > > > > > > > > Inactive hide details for Michael Robbert ---08/07/2013 06:30:05 > > PM---I've got a small test cluster with working stateless nodeMichael > > Robbert ---08/07/2013 06:30:05 PM---I've got a small test cluster with > > working stateless nodes. Recently I tried to add 2 more nodes an > > > > From: Michael Robbert <mrobb...@mines.edu> > > To: <xcat-user@lists.sourceforge.net>, > > Date: 08/07/2013 06:30 PM > > Subject: [xcat-user] New nodes not deploying > > > > ------------------------------------------------------------------------ > > > > > > > > I've got a small test cluster with working stateless nodes. Recently I > > tried to add 2 more nodes and I can't get them to deploy the same > > stateless image. For some reason the tftp configuration files are > > getting touched when I run nodeset, but not changed to point to the > > correct boot images. They are staying with genesis boot images. > > I have tried various incarnations of nodeadd and nodediscover, always > > followed by a nodeset $nodename osimage=mycomputeimage > > The nodeset command will change /tftpboot/xcat/xnba/nodes/$nodename file > > for the working nodes and it updates the timestamp for the non-working > > nodes, but the file still points to the genesis boot images. Am I > > missing a step? > > > > Here is my setup. > > xCAT server: > > > > [root@mgmt nodes]# cat /etc/redhat-release > > CentOS release 6.4 (Final) > > > > [root@mgmt nodes]# rpm -qa|grep -i xcat > > ipmitool-xcat-1.8.11-3.x86_64 > > xCAT-buildkit-2.8.2-snap201307222332.noarch > > xCAT-UI-2.8.2-snap201307222329.noarch > > yaboot-xcat-1.3.17-rc1.noarch > > xCAT-client-2.8.2-snap201307222328.noarch > > xCAT-2.8.2-snap201307222333.x86_64 > > xCAT-genesis-base-x86_64-2.8-snap201305300347.noarch > > perl-xCAT-2.8.2-snap201307222328.noarch > > conserver-xcat-8.1.16-10.x86_64 > > xCAT-UI-deps-2.8-2.noarch > > syslinux-xcat-3.86-2.noarch > > xCAT-genesis-scripts-x86_64-2.8.2-snap201307222333.noarch > > elilo-xcat-3.14-4.noarch > > openslp-xcat-1.2.1-1.x86_64 > > xCAT-server-2.8.2-snap201307222328.noarch > > > > [root@mgmt nodes]# lsdef -t osimage mycomputeimage > > Object name: mycomputeimage > > exlist=/install/custom/netboot/centos/compute.exlist > > imagetype=linux > > osarch=x86_64 > > osname=Linux > > osvers=centos6.3 > > otherpkgdir=/install/post/otherpkgs/centos6.3/x86_64 > > > otherpkglist=/install/custom/netboot/centos/compute.otherpkgs.pkglist > > permission=755 > > pkgdir=/install/centos6.3/x86_64 > > pkglist=/install/custom/netboot/centos/compute.pkglist > > postbootscripts=configiba > > postinstall=/install/custom/netboot/centos/compute.postinstall > > postscripts=configiba,syncfiles > > profile=compute > > provmethod=netboot > > rootimgdir=/install/netboot/centos6.3/x86_64/compute > > synclists=/install/custom/netboot/centos/compute.synclist > > > > This is a previously configured and currently working node: > > [root@mgmt nodes]# lsdef compute004 > > Object name: compute004 > > arch=x86_64 > > bmc=compute004-bmc > > bmcport=0 > > chain=runcmd=standby > > cons=ipmi > > currstate=netboot centos6.3-x86_64-compute > > groups=compute,ipmi,all > > initrd=xcat/osimage/mycomputeimage/initrd-stateless.gz > > installnic=eth0 > > ip=172.17.8.4 > > > > > kcmdline=imgurl=http://172.17.0.1:80//install/netboot/centos6.3/x86_64/compute/rootimg.gz > > > > XCAT=!myipfn!:3001 NODE=compute004 ifname=eth0:00:30:48:f2:87:c4 > > netdev=eth0 console=tty0 console=ttyS0,115200n8r > > kernel=xcat/osimage/mycomputeimage/kernel > > mac=00:30:48:f2:87:c4 > > mgt=ipmi > > netboot=xnba > > nfsserver=172.17.0.1 > > nodetype=osi > > ondiscover=nodediscover > > os=centos6.3 > > otherinterfaces=compute004-bmc:172.17.32.4 > > postbootscripts=otherpkgs,setroute > > > > > postscripts=syslog,remoteshell,syncfiles,setupntp,addexternalyum,configiba,orangefs,setroute > > power=ipmi > > primarynic=eth0 > > profile=compute > > provmethod=mycomputeimage > > routenames=ct_gc,ct_gc_eth > > serialflow=hard > > serialport=0 > > serialspeed=115200 > > status=failed > > statustime=08-05-2013 16:49:52 > > tftpserver=172.17.0.1 > > updatestatus=synced > > updatestatustime=07-29-2013 15:36:16 > > > > This is one of the new nodes that isn't working: > > [root@mgmt nodes]# lsdef compute084 > > Object name: compute084 > > arch=x86_64 > > chain=runcmd=standby > > currchain=runcmd=standby > > currstate=runcmd=standby > > groups=compute > > initrd=xcat/genesis.fs.x86_64.lzma > > installnic=eth0 > > ip=172.17.8.84 > > kcmdline=quiet xcatd=172.17.0.1:3001 destiny=runcmd=standby > > kernel=xcat/genesis.kernel.x86_64 > > mac=00:25:90:19:92:52 > > mgt=ipmi > > netboot=xnba > > nfsserver=172.17.0.1 > > nodetype=osi > > ondiscover=nodediscover > > os=centos6.3 > > otherinterfaces=compute084-bmc:172.17.32.84 > > postbootscripts=otherpkgs,setroute > > > > > postscripts=syslog,remoteshell,syncfiles,setupntp,addexternalyum,configiba,orangefs,setroute > > primarynic=eth0 > > profile=compute > > provmethod=mycomputeimage > > routenames=ct_gc,ct_gc_eth > > serial=P1715140 > > status=booting > > statustime=08-07-2013 14:43:40 > > supportedarchs=x86,x86_64 > > switch=Binary file (standard input) matches > > switchport=Binary file (standard input) matches > > tftpserver=172.17.0.1 > > > > Let me know what else you want to see. > > > > Thanks, > > Mike Robbert > > Colorado School of Mines > > > > /(See attached file: > > > smime.p7s)/------------------------------------------------------------------------------ > > Get 100% visibility into Java/.NET code with AppDynamics Lite! > > It's a free troubleshooting tool designed for production. > > Get down to code-level detail for bottlenecks, with <2% overhead. > > Download for free and get started troubleshooting in minutes. > > > http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________ > > xCAT-user mailing list > > xCAT-user@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/xcat-user > > > > > > > > > ------------------------------------------------------------------------------ > > Get 100% visibility into Java/.NET code with AppDynamics Lite! > > It's a free troubleshooting tool designed for production. > > Get down to code-level detail for bottlenecks, with <2% overhead. > > Download for free and get started troubleshooting in minutes. > > > http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk > > > > > > > > _______________________________________________ > > xCAT-user mailing list > > xCAT-user@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/xcat-user > > > > [attachment "smime.p7s" deleted by Lissa Valletta/Poughkeepsie/IBM] > ------------------------------------------------------------------------------ > Get 100% visibility into Java/.NET code with AppDynamics Lite! > It's a free troubleshooting tool designed for production. > Get down to code-level detail for bottlenecks, with <2% overhead. > Download for free and get started troubleshooting in minutes. > http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________ > xCAT-user mailing list > xCAT-user@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xcat-user > > [attachment "graycol.gif" deleted by Lissa Valletta/Poughkeepsie/IBM] > ------------------------------------------------------------------------------ > Get 100% visibility into Java/.NET code with AppDynamics Lite! > It's a free troubleshooting tool designed for production. > Get down to code-level detail for bottlenecks, with <2% overhead. > Download for free and get started troubleshooting in minutes. > http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________ > xCAT-user mailing list > xCAT-user@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xcat-user > > > > ------------------------------------------------------------------------------ > Get 100% visibility into Java/.NET code with AppDynamics Lite! > It's a free troubleshooting tool designed for production. > Get down to code-level detail for bottlenecks, with <2% overhead. > Download for free and get started troubleshooting in minutes. > http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk > > > > _______________________________________________ > xCAT-user mailing list > xCAT-user@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xcat-user > (See attached file: smime.p7s) ------------------------------------------------------------------------------ Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with <2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
<<inline: graycol.gif>>
smime.p7s
Description: Binary data
------------------------------------------------------------------------------ Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with <2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
_______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user