I assume if the cluster was built by the CET with xCAT, then rinstall must have been working to these nodes in the past. Running that command or the equivalent two commands is part of the process of installing the nodes. nodeset <nodename> install rpower <nodename> boot
The only thing I can suggest is check your configuration against our documentation http://sourceforge.net/apps/mediawiki/xcat/index.php?title=XCAT_BladeCenter_Linux_Cluster Maybe someone else on the mailing list will recognize the errors you get. Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 From: Jacob Blevens <1jacobblevens...@gmail.com> To: Lissa Valletta/Poughkeepsie/IBM@IBMUS Cc: xcat-user@lists.sourceforge.net Date: 10/23/2012 01:17 PM Subject: Re: Fw: [xcat-user] Fwd: On a Previously Working Node and Rinstall Fails The hardware for 'node8' is an x3550 M3 and it is likely that this particular node was installed from the Management Node and not from the Service Node based on the configuration. Secondly, 'node8' was originally built by the CET with xCAT. To clarify, node1-8 were all built and were running rhel6.1 fine until an 'rinstall node8' was performed on 'node8'. If i were to run an 'rinstall' on another node in this group 'test', I would assume it would do the same thing as 'node8' which is not a new node install, it was is a working node that 'rinstall node8' was performed on unsuccessfully. Kindly, Mr. Blevens On Tue, Oct 23, 2012 at 12:02 PM, Lissa Valletta <lis...@us.ibm.com> wrote: I notice that you have this set servicenode=servicenode_ip,managements_ip and nfsserver=managements_ip. Since before you had xcatmaster=managements_ip, is it possible that the nodes were always installing from the management node and not from your service node. Another question - was node8 installed originally by xCAT, or are you trying to setup to install a node using xCAT for the first time. What hardware are you using? Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 Inactive hide details for Jacob Blevens ---10/23/2012 10:58:42 AM---Lissa, Thank you for your response and support on this!Jacob Blevens ---10/23/2012 10:58:42 AM---Lissa, Thank you for your response and support on this! From: Jacob Blevens <1jacobblevens...@gmail.com> To: xcat-user@lists.sourceforge.net Cc: Lissa Valletta/Poughkeepsie/IBM@IBMUS Date: 10/23/2012 10:58 AM Subject: Re: Fw: [xcat-user] Fwd: On a Previously Working Node and Rinstall Fails Lissa, Thank you for your response and support on this! We are running xCAT Version 2.6.11 (svn r11798, built Thur Mar 8 16:09 2012) on both the Management Node and the Service Node. Management Node and Service Node are running RHEL 6.1. The image install for the 'node8' is at RHEL 6.1. Both Management and Service nodes are syncing tables correctly. I investigated the xcatmaster attribute for the node and corrected the following entry in the noderes xcat configuration table: From: "test","servicenode_ip,managementnode_ip","pxe",,"managementnode_ip",,,"mac","mac",,, "managementnode_ip",,,,,,"0" To: "test","servicenode_ip,managementnode_ip","pxe",,"managementnode_ip",,,"mac","mac",,, "servicenode_ip",,,,,,"0" With the xcatmaster attribute pointing to the Service Node, I ran another install for node8 'rinstall node8' and still received the same hangup when it was trying to boot as identified below and in the previous note: - The following command is run # rinstall node8 - At the console of node8, it gets its ip - mgt(management) and sn(servicenode) servers perform dhcpdiscover/dhcpoffer/dhcprequest/dhcpack for node8 - mgt server successfully transfers the rhels6.1 vmlinuz and initrd.img - At the console the "Ready ..." message is displayed and starts to load - Then if freezes displaying the following last lines outputted: Initializing network drop monitor service Freeing unused kernel memory: 1232k freed Write protecting the kernel read-only data: 10240k Freeing unused kernel memory: 1112k freed Freeing unused kernel memory: 1796k freed usb 5-2: New USB device found, idVendor=04b3, idProduct=4010 usb 5-2: New USB device strings: Mfr=1, Product=2, SerialNumber=0 usb 5-2: Product: RNDISKDC ETHER Thank you again! Kindly, Mr. Blevens From: Lissa Valletta/Poughkeepsie/IBM@IBMUS To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>, Cc: xcat-user@lists.sourceforge.net Date: 10/23/2012 07:48 AM Subject: Re: [xcat-user] Fwd: On a Previously Working Node and Rinstall Fails Could you give us the level of xCAT you are running? Is it the same level on the Service node? nodels -v will do it. I did notice that on your lsdef of the node8 below, your xcatmaster attribute is the address of the management node. If you want that node installed by the service node, then it should be the ip address of the servicenode as known by the node. Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 Inactive hide details for Jacob Blevens ---10/22/2012 03:55:34 PM---*Background:* On a previously built x3550 M3 server with a Jacob Blevens ---10/22/2012 03:55:34 PM---*Background:* On a previously built x3550 M3 server with a stateful install of RHEL 6.1 From: Jacob Blevens <1jacobblevens...@gmail.com> To: xcat-user@lists.sourceforge.net Date: 10/22/2012 03:55 PM Subject: [xcat-user] Fwd: On a Previously Working Node and Rinstall Fails Background: On a previously built x3550 M3 server with a stateful install of RHEL 6.1 that was working fine after an IBM CET install does not work after running 'rinstall' on the node. It is important to note that the cluster has a management node and a servicenode. We tested 'rinstalling' the test 'node8' with the following results: The sequence of events before problem: - The following command is run # rinstall node8 - At the console of node8, it gets its ip - mgt(management) and sn(servicenode) servers perform dhcpdiscover/dhcpoffer/dhcprequest/dhcpack for node8 - mgt server successfully transfers the rhels6.1 vmlinuz and initrd.img - At the console the "Ready ..." message is displayed and starts to load - Then if freezes displaying the following last lines outputted: Initializing network drop monitor service Freeing unused kernel memory: 1232k freed Write protecting the kernel read-only data: 10240k Freeing unused kernel memory: 1112k freed Freeing unused kernel memory: 1796k freed usb 5-2: New USB device found, idVendor=04b3, idProduct=4010 usb 5-2: New USB device strings: Mfr=1, Product=2, SerialNumber=0 usb 5-2: Product: RNDISKDC ETHER At during the problem troubleshooting: - Can ping node8 - Can login to the IMM through the browser - Cannot ssh or telnet into node8 - rcons is empty with nothing to display/no logon xCAT tables: #nodels node8 chain chain.ondiscover: nodediscover chain.chain: runcmd=bmcsetup,standby chain.node:node8 chain.currstate: install rhels6.1-x86_64-compute chain.currchain: boot chain.commments: chain.disable: #lsdef node8 Object Name: node8 arch = x86_64 bmc=node8-bmc bmcport=0 chain=runcmd=bmcsetup,standby cons=ipmi conserver=#.#.#.# (managements_ip) currchain=boot currstate=install rhels6.1-x86_64-compute groups=rack10,test,intel,ipmi,compute initrd=xcat/rhels6.1/x86_64/initrd.img installnic=mac kcmdline=nofb utf8 ks:http://managements_ip/install/autoinst/node8 ksdevice=#:#:#:#:#:# console=tty0 console=tty0,115200n8r noipv6 kernel=xcat/rhels6.1/x86_64/vmlinuz mac=#:#:#:#:#:# mgt=ipmi mtm=serial## netboot=pxe nfsserver=managements_ip nodetype=osi ondiscover=nodediscover os=rhels6.1 postbootscripts=otherpkgs,site.post,site.gpfs postscripts=syslog,remoteshell,syncfiles,site.hardeths,setupntp power=ipmi primarynic=mac profile=compute provmethod=install rack=10 serial=K###### serialflow=hard serialport=0 serial speed=115200 servicenode=servicenode_ip,managements_ip status=installing statustime=10-22-2012 13:00 switch=cisco-enet01 switchport=#/# unit=40 xcatmaster=managements_ip #tabdump nodehm "ipmi","ipmi","ipmi",,,"managements_ip","0","115200","hard",,, "test","ipmi","ipmi","ipmi",,,"managements_ip","0","115200","hard",,, #tabdump ipmi "ipmi","/\z/-bmc/","0",,,, #tabdump nodetype "test","rhels6.1","x86_64",,,, Any input or advice on how to resolve this issue would be greatly appreciated! Thank you! Mr. Blevens ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
<<inline: graycol.gif>>
------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct
_______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user