I notice that you have this set servicenode=servicenode_ip,managements_ip
and nfsserver=managements_ip.   Since before you had
xcatmaster=managements_ip,  is it possible that the nodes were always
installing from the management node and not from your service node.
Another  question - was node8 installed originally by xCAT, or are you
trying to setup to install a node using xCAT for the first time.
What hardware are you using?

Lissa K. Valletta
2-3/T12
Poughkeepsie, NY 12601
(tie 293) 433-3102





From:   Jacob Blevens <1jacobblevens...@gmail.com>
To:     xcat-user@lists.sourceforge.net
Cc:     Lissa Valletta/Poughkeepsie/IBM@IBMUS
Date:   10/23/2012 10:58 AM
Subject:        Re: Fw: [xcat-user] Fwd: On a Previously Working Node and
            Rinstall Fails



Lissa,
Thank you for your response and support on this!

We are running xCAT Version 2.6.11 (svn r11798, built Thur Mar 8 16:09
2012) on both the Management Node and the Service Node.  Management Node
and Service Node are running RHEL 6.1.  The image install for the 'node8'
is at RHEL 6.1.  Both Management and Service nodes are syncing tables
correctly.

I investigated the xcatmaster attribute for the node and corrected the
following entry in the noderes xcat configuration table:

From:
"test","servicenode_ip,managementnode_ip","pxe",,"managementnode_ip",,,"mac","mac",,,
"managementnode_ip",,,,,,"0"

To:
"test","servicenode_ip,managementnode_ip","pxe",,"managementnode_ip",,,"mac","mac",,,
"servicenode_ip",,,,,,"0"

With the xcatmaster attribute pointing to the Service Node, I ran another
install for node8 'rinstall node8' and still received the same hangup when
it was trying to boot as identified below and in the previous note:

- The following command is run # rinstall node8
- At the console of node8, it gets its ip
- mgt(management) and sn(servicenode) servers perform
dhcpdiscover/dhcpoffer/dhcprequest/dhcpack for node8
- mgt server successfully transfers the rhels6.1 vmlinuz and initrd.img
- At the console the "Ready ..." message is displayed and starts to load
- Then if freezes displaying the following last lines outputted:

Initializing network drop monitor service
Freeing unused kernel memory: 1232k freed
Write protecting the kernel read-only data: 10240k
Freeing unused kernel memory: 1112k freed
Freeing unused kernel memory: 1796k freed
usb 5-2: New USB device found, idVendor=04b3, idProduct=4010
usb 5-2: New USB device strings: Mfr=1, Product=2, SerialNumber=0
usb 5-2: Product: RNDISKDC ETHER

Thank you again! Kindly,
Mr. Blevens

  From: Lissa Valletta/Poughkeepsie/IBM@IBMUS
  To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>,
  Cc: xcat-user@lists.sourceforge.net
  Date: 10/23/2012 07:48 AM
  Subject: Re: [xcat-user] Fwd: On a Previously Working Node and Rinstall
  Fails



  Could you give us the level of xCAT you are running?   Is it the same
  level on the Service node?
  nodels -v  will do it.

  I did notice that on your lsdef of the node8 below,  your xcatmaster
  attribute  is the address of the management node.  If you want that node
  installed by the service node, then it should be the ip address of the
  servicenode as known by the node.


  Lissa K. Valletta
  2-3/T12
  Poughkeepsie, NY 12601
  (tie 293) 433-3102



  Inactive hide details for Jacob Blevens ---10/22/2012 03:55:34
  PM---*Background:* On a previously built x3550 M3 server with a Jacob
  Blevens ---10/22/2012 03:55:34 PM---*Background:* On a previously built
  x3550 M3 server with a stateful install of RHEL 6.1

  From: Jacob Blevens <1jacobblevens...@gmail.com>
  To: xcat-user@lists.sourceforge.net
  Date: 10/22/2012 03:55 PM
  Subject: [xcat-user] Fwd: On a Previously Working Node and Rinstall Fails



  Background:
  On a previously built x3550 M3 server with a stateful install of RHEL 6.1
  that was working fine after an IBM CET install does not work after
  running 'rinstall' on the node.  It is important to note that the cluster
  has a management node and a servicenode.  We tested 'rinstalling' the
  test 'node8' with the following results:

  The sequence of events before problem:
  - The following command is run # rinstall node8
  - At the console of node8, it gets its ip
  - mgt(management) and sn(servicenode) servers perform
  dhcpdiscover/dhcpoffer/dhcprequest/dhcpack for node8
  - mgt server successfully transfers the rhels6.1 vmlinuz and initrd.img
  - At the console the "Ready ..." message is displayed and starts to load
  - Then if freezes displaying the following last lines outputted:

  Initializing network drop monitor service
  Freeing unused kernel memory: 1232k freed
  Write protecting the kernel read-only data: 10240k
  Freeing unused kernel memory: 1112k freed
  Freeing unused kernel memory: 1796k freed
  usb 5-2: New USB device found, idVendor=04b3, idProduct=4010
  usb 5-2: New USB device strings: Mfr=1, Product=2, SerialNumber=0
  usb 5-2: Product: RNDISKDC ETHER

  At during the problem troubleshooting:
  - Can ping node8
  - Can login to the IMM through the browser
  - Cannot ssh or telnet into node8
  - rcons is empty with nothing to display/no logon

  xCAT tables:

  #nodels node8 chain
  chain.ondiscover: nodediscover
  chain.chain: runcmd=bmcsetup,standby
  chain.node:node8
  chain.currstate: install rhels6.1-x86_64-compute
  chain.currchain: boot
  chain.commments:
  chain.disable:

  #lsdef node8
  Object Name: node8
  arch = x86_64
  bmc=node8-bmc
  bmcport=0
  chain=runcmd=bmcsetup,standby
  cons=ipmi
  conserver=#.#.#.# (managements_ip)
  currchain=boot
  currstate=install rhels6.1-x86_64-compute
  groups=rack10,test,intel,ipmi,compute
  initrd=xcat/rhels6.1/x86_64/initrd.img
  installnic=mac
  kcmdline=nofb utf8 ks:http://managements_ip/install/autoinst/node8
  ksdevice=#:#:#:#:#:# console=tty0
  console=tty0,115200n8r noipv6
  kernel=xcat/rhels6.1/x86_64/vmlinuz
  mac=#:#:#:#:#:#
  mgt=ipmi
  mtm=serial##
  netboot=pxe
  nfsserver=managements_ip
  nodetype=osi
  ondiscover=nodediscover
  os=rhels6.1
  postbootscripts=otherpkgs,site.post,site.gpfs
  postscripts=syslog,remoteshell,syncfiles,site.hardeths,setupntp
  power=ipmi
  primarynic=mac
  profile=compute
  provmethod=install
  rack=10
  serial=K######
  serialflow=hard
  serialport=0
  serial speed=115200
  servicenode=servicenode_ip,managements_ip
  status=installing
  statustime=10-22-2012 13:00
  switch=cisco-enet01
  switchport=#/#
  unit=40
  xcatmaster=managements_ip

  #tabdump nodehm
  "ipmi","ipmi","ipmi",,,"managements_ip","0","115200","hard",,,
  "test","ipmi","ipmi","ipmi",,,"managements_ip","0","115200","hard",,,

  #tabdump ipmi
  "ipmi","/\z/-bmc/","0",,,,

  #tabdump nodetype
  "test","rhels6.1","x86_64",,,,

  Any input or advice on how to resolve this issue would be greatly
  appreciated!  Thank you!
  Mr. Blevens





  ------------------------------------------------------------------------------

  Everyone hates slow websites. So do we.
  Make your web apps faster with AppDynamics
  Download AppDynamics Lite for free today:
  http://p.sf.net/sfu/appdyn_sfd2d_oct
  _______________________________________________
  xCAT-user mailing list
  xCAT-user@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/xcat-user
  ------------------------------------------------------------------------------

  Everyone hates slow websites. So do we.
  Make your web apps faster with AppDynamics
  Download AppDynamics Lite for free today:
  http://p.sf.net/sfu/appdyn_sfd2d_oct
  _______________________________________________
  xCAT-user mailing list
  xCAT-user@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/xcat-user




<<inline: graycol.gif>>

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_sfd2d_oct
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to