XCATBYPASS means you are not running with xcatd.   If the problem is
running with xcatd,  I would like to understand why.  That would be  a bug.

Lissa K. Valletta
8-3/B10
Poughkeepsie, NY 12601
(tie 293) 433-3102





From:   Michael Robbert <mrobb...@mines.edu>
To:     <xcat-user@lists.sourceforge.net>,
Date:   08/16/2013 01:10 PM
Subject:        Re: [xcat-user] New nodes not deploying



Lissa,
When I tried steps 1 and 2 it worked. I haven't had time to look at it
deeper so I don't know if something else changed since the last time I
tried or if XCATBYPASS is what fixed it. I'll try to let you know if I
do find anything, but for now don't worry about it.

Mike

On 8/14/13 4:00 AM, Lissa Valletta wrote:
> I have a couple of debug ideas.
> 1) export XCATBYPASS=1    ( that makes you run without the daemon for
> debugging)
> 2) run your nodeset command  -- do you see errors
>
> 3) unset XCATBYPASS
>     service xcatd stop
>     /opt/xcat/sbin/xcatd -f    ( that runs the daemon in the foreground
> In another window, run nodeset,  Do you see anything errors being
> displayed the daemon
>
> 4) Get out of this by  service xcatd start in the other window.  That
> should shutdown the xcatd -f and bring you up normal.   If not,  ps -ef
> | grep xcatd and kill -9  any xcatd process and then service xcatd
restart.
>
> If none of this works.   Can we get a copy of your database.   I would
>   tabprune -a auditlog  ( if you are using auditlog)  and then
> dumpxCATdb,    tar and compress it and send it to lis...@us.ibm.com.
> The only other option is can I get on the system.  You can send a note
> to that same userid.
>
> Are you just using sqlite ( the default database)?
>
> Lissa K. Valletta
> 8-3/B10
> Poughkeepsie, NY 12601
> (tie 293) 433-3102
>
>
>
> Inactive hide details for Michael Robbert ---08/09/2013 11:33:00
> AM---Here is a diff comparing a working and non-working node. Michael
> Robbert ---08/09/2013 11:33:00 AM---Here is a diff comparing a working
> and non-working node. [root@mgmt ~]# diff /tmp/compute004.def /tm
>
> From: Michael Robbert <mrobb...@mines.edu>
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>,
> Date: 08/09/2013 11:33 AM
> Subject: Re: [xcat-user] New nodes not deploying
>
> ------------------------------------------------------------------------
>
>
>
> Here is a diff comparing a working and non-working node.
>
> [root@mgmt ~]# diff /tmp/compute004.def /tmp/compute084.def
> 1c1
> < Object name: compute004
> ---
>  > Object name: compute084
> 3c3
> <     bmc=compute004-bmc
> ---
>  >     bmc=compute084-bmc
> 7c7,8
> <     currstate=netboot centos6.3-x86_64-compute
> ---
>  >     currchain=runcmd=standby
>  >     currstate=runcmd=standby
> 9c10
> <     initrd=xcat/osimage/mycomputeimage/initrd-stateless.gz
> ---
>  >     initrd=xcat/genesis.fs.x86_64.lzma
> 11,14c12,15
> <     ip=172.17.8.4
> <
>
kcmdline=imgurl=http://172.17.0.1:80//install/netboot/centos6.3/x86_64/compute/rootimg.gz

> XCAT=!myipfn!:3001 NODE=compute004 ifname=eth0:00:30:48:f2:87:c4
> netdev=eth0  console=tty0 console=ttyS0,115200n8r
> <     kernel=xcat/osimage/mycomputeimage/kernel
> <     mac=00:30:48:f2:87:c4
> ---
>  >     ip=172.17.8.84
>  >     kcmdline=quiet console=tty0 console=ttyS0,115200
> xcatd=172.17.0.1:3001 destiny=runcmd=standby
>  >     kernel=xcat/genesis.kernel.x86_64
>  >     mac=00:25:90:19:92:52
> 21c22
> <     otherinterfaces=compute004-bmc:172.17.32.4
> ---
>  >     otherinterfaces=compute084-bmc:172.17.32.84
> 32,33c33,34
> <     status=failed
> <     statustime=08-05-2013 16:49:52
> ---
>  >     status=configuring
>  >     statustime=08-08-2013 13:35:03
> 35,36d35
> <     updatestatus=synced
> <     updatestatustime=07-29-2013 15:36:16
>
>
> The only differences I see are IPs and node unique numbering plus the
> things that I think nodeset should be changing for me.
> I run this:
>
> [root@mgmt ~]# nodeset compute084 osimage=mycomputeimage
> compute084: netboot centos6.3-x86_64-compute
>
> and nothing changes.
> I have tried running xcatdebug to see if I could see what is happening
> under the covers, but when I do the daemon stops responding to commands
> and needs to be restarted.
> Also regarding the switch name, it is being populated by the
> nodediscovery process. I'm not using switch discovery though, just
> sequential discovery.
>
> Mike
> ________________________________________
> From: Lissa Valletta [lis...@us.ibm.com]
> Sent: Friday, August 09, 2013 7:55
> To: xCAT Users Mailing list
> Subject: Re: [xcat-user] New nodes not deploying
>
> It the last suggestion does not work, then I would take the two lsdef's
>   and remove any attribute  in the bad node that is not in the good
> node. until you have the exact same attributes defined in the bad node
> as in the good node.
>
> Lissa K. Valletta
> 8-3/B10
> Poughkeepsie, NY 12601
> (tie 293) 433-3102
>
>
>
> [Inactive hide details for Michael Robbert ---08/08/2013 03:30:23
> PM---Didn't work. I recreated dhcp with those commands. I had]Michael
> Robbert ---08/08/2013 03:30:23 PM---Didn't work. I recreated dhcp with
> those commands. I had run makedhcp  before, but probably not with
>
> From: Michael Robbert <mrobb...@mines.edu>
> To: <xcat-user@lists.sourceforge.net>,
> Date: 08/08/2013 03:30 PM
> Subject: Re: [xcat-user] New nodes not deploying
>
> ________________________________
>
>
>
> Didn't work. I recreated dhcp with those commands. I had run makedhcp
> before, but probably not with -n. The nodes are and were showing up in
> the leases file so that didn't appear to be the problem. I also made
> some changes to the nodehm table so that the serial attributes are
> showing up on the new nodes. Still nodeset runs without error, but
> doesn't change the tftp config file for the node.
> I'm just not sure how to debug this. It is just failing silently.
>
> Thanks for any tips,
> Mike
>
> On 8/8/13 5:16 AM, Lissa Valletta wrote:
>  > Thanks for all the good information!
>  >
>  > Did  you run makedhcp -a to pick up the new nodes .  Also you will
need
>  > to run makeconservercf to pick up the new nodes.
>  > You might need to run makedhcp -n  followed by makedhcp -a.
>  >
>  > Also these attributes are missing in the new nodes
>  >   serialflow=hard
>  >      serialport=0
>  >      serialspeed=115200
>  >
>  >
>  > Lissa K. Valletta
>  > 8-3/B10
>  > Poughkeepsie, NY 12601
>  > (tie 293) 433-3102
>  >
>  >
>  >
>  > Inactive hide details for Michael Robbert ---08/07/2013 06:30:05
>  > PM---I've got a small test cluster with working stateless nodeMichael
>  > Robbert ---08/07/2013 06:30:05 PM---I've got a small test cluster with
>  > working stateless nodes. Recently I  tried to add 2 more nodes an
>  >
>  > From: Michael Robbert <mrobb...@mines.edu>
>  > To: <xcat-user@lists.sourceforge.net>,
>  > Date: 08/07/2013 06:30 PM
>  > Subject: [xcat-user] New nodes not deploying
>  >
>  >
------------------------------------------------------------------------
>  >
>  >
>  >
>  > I've got a small test cluster with working stateless nodes. Recently I
>  > tried to add 2 more nodes and I can't get them to deploy the same
>  > stateless image. For some reason the tftp configuration files are
>  > getting touched when I run nodeset, but not changed to point to the
>  > correct boot images. They are staying with genesis boot images.
>  > I have tried various incarnations of nodeadd and nodediscover, always
>  > followed by a nodeset $nodename osimage=mycomputeimage
>  > The nodeset command will change /tftpboot/xcat/xnba/nodes/$nodename
file
>  > for the working nodes and it updates the timestamp for the non-working
>  > nodes, but the file still points to the genesis boot images. Am I
>  > missing a step?
>  >
>  > Here is my setup.
>  > xCAT server:
>  >
>  > [root@mgmt nodes]# cat /etc/redhat-release
>  > CentOS release 6.4 (Final)
>  >
>  > [root@mgmt nodes]# rpm -qa|grep -i xcat
>  > ipmitool-xcat-1.8.11-3.x86_64
>  > xCAT-buildkit-2.8.2-snap201307222332.noarch
>  > xCAT-UI-2.8.2-snap201307222329.noarch
>  > yaboot-xcat-1.3.17-rc1.noarch
>  > xCAT-client-2.8.2-snap201307222328.noarch
>  > xCAT-2.8.2-snap201307222333.x86_64
>  > xCAT-genesis-base-x86_64-2.8-snap201305300347.noarch
>  > perl-xCAT-2.8.2-snap201307222328.noarch
>  > conserver-xcat-8.1.16-10.x86_64
>  > xCAT-UI-deps-2.8-2.noarch
>  > syslinux-xcat-3.86-2.noarch
>  > xCAT-genesis-scripts-x86_64-2.8.2-snap201307222333.noarch
>  > elilo-xcat-3.14-4.noarch
>  > openslp-xcat-1.2.1-1.x86_64
>  > xCAT-server-2.8.2-snap201307222328.noarch
>  >
>  > [root@mgmt nodes]# lsdef -t osimage mycomputeimage
>  > Object name: mycomputeimage
>  >      exlist=/install/custom/netboot/centos/compute.exlist
>  >      imagetype=linux
>  >      osarch=x86_64
>  >      osname=Linux
>  >      osvers=centos6.3
>  >      otherpkgdir=/install/post/otherpkgs/centos6.3/x86_64
>  >
>   otherpkglist=/install/custom/netboot/centos/compute.otherpkgs.pkglist
>  >      permission=755
>  >      pkgdir=/install/centos6.3/x86_64
>  >      pkglist=/install/custom/netboot/centos/compute.pkglist
>  >      postbootscripts=configiba
>  >      postinstall=/install/custom/netboot/centos/compute.postinstall
>  >      postscripts=configiba,syncfiles
>  >      profile=compute
>  >      provmethod=netboot
>  >      rootimgdir=/install/netboot/centos6.3/x86_64/compute
>  >      synclists=/install/custom/netboot/centos/compute.synclist
>  >
>  > This is a previously configured and currently working node:
>  > [root@mgmt nodes]# lsdef compute004
>  > Object name: compute004
>  >      arch=x86_64
>  >      bmc=compute004-bmc
>  >      bmcport=0
>  >      chain=runcmd=standby
>  >      cons=ipmi
>  >      currstate=netboot centos6.3-x86_64-compute
>  >      groups=compute,ipmi,all
>  >      initrd=xcat/osimage/mycomputeimage/initrd-stateless.gz
>  >      installnic=eth0
>  >      ip=172.17.8.4
>  >
>  >
>
kcmdline=imgurl=http://172.17.0.1:80//install/netboot/centos6.3/x86_64/compute/rootimg.gz

>  >
>  > XCAT=!myipfn!:3001 NODE=compute004 ifname=eth0:00:30:48:f2:87:c4
>  > netdev=eth0  console=tty0 console=ttyS0,115200n8r
>  >      kernel=xcat/osimage/mycomputeimage/kernel
>  >      mac=00:30:48:f2:87:c4
>  >      mgt=ipmi
>  >      netboot=xnba
>  >      nfsserver=172.17.0.1
>  >      nodetype=osi
>  >      ondiscover=nodediscover
>  >      os=centos6.3
>  >      otherinterfaces=compute004-bmc:172.17.32.4
>  >      postbootscripts=otherpkgs,setroute
>  >
>  >
>
postscripts=syslog,remoteshell,syncfiles,setupntp,addexternalyum,configiba,orangefs,setroute

>  >      power=ipmi
>  >      primarynic=eth0
>  >      profile=compute
>  >      provmethod=mycomputeimage
>  >      routenames=ct_gc,ct_gc_eth
>  >      serialflow=hard
>  >      serialport=0
>  >      serialspeed=115200
>  >      status=failed
>  >      statustime=08-05-2013 16:49:52
>  >      tftpserver=172.17.0.1
>  >      updatestatus=synced
>  >      updatestatustime=07-29-2013 15:36:16
>  >
>  > This is one of the new nodes that isn't working:
>  > [root@mgmt nodes]# lsdef compute084
>  > Object name: compute084
>  >      arch=x86_64
>  >      chain=runcmd=standby
>  >      currchain=runcmd=standby
>  >      currstate=runcmd=standby
>  >      groups=compute
>  >      initrd=xcat/genesis.fs.x86_64.lzma
>  >      installnic=eth0
>  >      ip=172.17.8.84
>  >      kcmdline=quiet xcatd=172.17.0.1:3001 destiny=runcmd=standby
>  >      kernel=xcat/genesis.kernel.x86_64
>  >      mac=00:25:90:19:92:52
>  >      mgt=ipmi
>  >      netboot=xnba
>  >      nfsserver=172.17.0.1
>  >      nodetype=osi
>  >      ondiscover=nodediscover
>  >      os=centos6.3
>  >      otherinterfaces=compute084-bmc:172.17.32.84
>  >      postbootscripts=otherpkgs,setroute
>  >
>  >
>
postscripts=syslog,remoteshell,syncfiles,setupntp,addexternalyum,configiba,orangefs,setroute

>  >      primarynic=eth0
>  >      profile=compute
>  >      provmethod=mycomputeimage
>  >      routenames=ct_gc,ct_gc_eth
>  >      serial=P1715140
>  >      status=booting
>  >      statustime=08-07-2013 14:43:40
>  >      supportedarchs=x86,x86_64
>  >      switch=Binary file (standard input) matches
>  >      switchport=Binary file (standard input) matches
>  >      tftpserver=172.17.0.1
>  >
>  > Let me know what else you want to see.
>  >
>  > Thanks,
>  > Mike Robbert
>  > Colorado School of Mines
>  >
>  > /(See attached file:
>  >
>
smime.p7s)/------------------------------------------------------------------------------

>  > Get 100% visibility into Java/.NET code with AppDynamics Lite!
>  > It's a free troubleshooting tool designed for production.
>  > Get down to code-level detail for bottlenecks, with <2% overhead.
>  > Download for free and get started troubleshooting in minutes.
>  >
>
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________

>  > xCAT-user mailing list
>  > xCAT-user@lists.sourceforge.net
>  > https://lists.sourceforge.net/lists/listinfo/xcat-user
>  >
>  >
>  >
>  >
>
------------------------------------------------------------------------------

>  > Get 100% visibility into Java/.NET code with AppDynamics Lite!
>  > It's a free troubleshooting tool designed for production.
>  > Get down to code-level detail for bottlenecks, with <2% overhead.
>  > Download for free and get started troubleshooting in minutes.
>  >
>
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
>  >
>  >
>  >
>  > _______________________________________________
>  > xCAT-user mailing list
>  > xCAT-user@lists.sourceforge.net
>  > https://lists.sourceforge.net/lists/listinfo/xcat-user
>  >
>
> [attachment "smime.p7s" deleted by Lissa Valletta/Poughkeepsie/IBM]
>
------------------------------------------------------------------------------

> Get 100% visibility into Java/.NET code with AppDynamics Lite!
> It's a free troubleshooting tool designed for production.
> Get down to code-level detail for bottlenecks, with <2% overhead.
> Download for free and get started troubleshooting in minutes.
>
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________

> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
> [attachment "graycol.gif" deleted by Lissa Valletta/Poughkeepsie/IBM]
>
------------------------------------------------------------------------------

> Get 100% visibility into Java/.NET code with AppDynamics Lite!
> It's a free troubleshooting tool designed for production.
> Get down to code-level detail for bottlenecks, with <2% overhead.
> Download for free and get started troubleshooting in minutes.
>
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________

> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
>
>
>
------------------------------------------------------------------------------

> Get 100% visibility into Java/.NET code with AppDynamics Lite!
> It's a free troubleshooting tool designed for production.
> Get down to code-level detail for bottlenecks, with <2% overhead.
> Download for free and get started troubleshooting in minutes.
>
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
>
>
>
> _______________________________________________
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>

(See attached file: smime.p7s)
------------------------------------------------------------------------------

Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

<<inline: graycol.gif>>

Attachment: smime.p7s
Description: Binary data

------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead. 
Download for free and get started troubleshooting in minutes. 
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to