Re: [xcat-user] postscript not run after install?
In fact make sure all files in the /install/postscripts directory and subdirectories are world readable. This can cause a failure that will show up in the wget.log that Guang Cheng Li asked for below. Thanks! Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 From: Lissa Valletta/Poughkeepsie/IBM@IBMUS To: xCAT Users Mailing list xcat-user@lists.sourceforge.net Cc: xCAT Users Mailing list xcat-user@lists.sourceforge.net Date: 07/25/2012 07:03 AM Subject:Re: [xcat-user] postscript not run after install? Do the postscripts you added to /install/postscripts have the correct permission. They must be world-readable and executable at least by root. Check against our postscripts, something like -rwxr-xr-x 1 root root 9869 Jul 3 09:50 syslog Also run lsdef nodename what is in the postscripts attribute? Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 Inactive hide details for Guang Cheng Li ---07/25/2012 12:14:37 AM---Before the postscripts are run, the postscripts need to beGuang Cheng Li ---07/25/2012 12:14:37 AM---Before the postscripts are run, the postscripts need to be downloaded from the management node or se From: Guang Cheng Li ligua...@cn.ibm.com To: xCAT Users Mailing list xcat-user@lists.sourceforge.net Date: 07/25/2012 12:14 AM Subject: Re: [xcat-user] postscript not run after install? Before the postscripts are run, the postscripts need to be downloaded from the management node or service node, if the download fails, the postscripts will not be run at all, at least seems to not run at all. Could you please post the content of /tmp/wget.log and /var/log/xcat/xcat.log on the compute node? Thanks, - Li,Guang Cheng (李光成) IBM China System Technology Laboratory Email: ligua...@cn.ibm.com Address: Building 28, ZhongGuanCun Software Park, No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC 北京市海淀区东北旺西路8号中关村软件园28号楼 邮编: 100193 Inactive hide details for SYED ASIF ZAHEER ---2012-07-25 12:04:46---From: SYED ASIF ZAHEER sa_zah...@hotmail.com To: XCAT HelSYED ASIF ZAHEER ---2012-07-25 12:04:46---From: SYED ASIF ZAHEER sa_zah...@hotmail.com To: XCAT Help xcat-user@lists.sourceforge.net From: SYED ASIF ZAHEER sa_zah...@hotmail.com To: XCAT Help xcat-user@lists.sourceforge.net Date: 2012-07-25 12:04 Subject: Re: [xcat-user] postscript not run after install? did you add your script to run in postscripts table tabedit postscripts [root@xgplg ~]# tabdump postscripts #node,postscripts,postbootscripts,comments,disable xcatdefaults,syslog,remoteshell,syncfiles,otherpkgs,, service,servicenode,xcatserver,xcatclient,,, compute,script_filename.sh,,, Regards Syed Asif Zaheer sa_zah...@hotmail.com Date: Wed, 25 Jul 2012 14:26:34 +1200 From: francois.bis...@canterbury.ac.nz To: xcat-user@lists.sourceforge.net Subject: Re: [xcat-user] postscript not run after install? On Wed, 25 Jul 2012 10:16:40 Guang Cheng Li wrote: HI, A lot of reasons might cause postscripts could not be run successfully, like the compute node can not resolve the hostname of the management node. Try the following debug procedure: 1. Anything in /tmp/wget.log? is the node trying to get the postscripts from the correct server? 2. /var/log/xcat/xcat.log, it might indicate what is going wrong In both case I only see messages coming from the time I ran updatenode nodename -P which does run fine. As far as I can tell it is not that the postscripts fail but they are not run at all after the install is finished. All I am seeing now is install reboot finish the install run postbootscript At no point are the postscripts run. I am not saying failling, I am saying not being run at all. Dr Francois Bissey BlueFern team -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
Re: [xcat-user] Install stateful node that is also connected to SAN storage
In the past, I added 'nostorage' to bootparams.addkcmdline and then in the KS template added: For RHEL = 5.x: device scsi mptsas In the ongoing effort to fix that which is not broken, Red Hat apparently removed this from kickstart, but I found the following which might prove helpful: https://www.redhat.com/archives/rhelv6-beta-list/2010-April/msg00090.html Don't have anything to test it on, though. YMMV since it's talking about 6-beta and the nostorage option was accidentally left in the documentation after 6 was released. Also might be useful: http://osdir.com/ml/kickstart-list/2012-02/msg00010.html Regards, Christian D. Caruthers Senior Consultant - System x Linux HPC 2D barcode - Mobile: 1-757-289-9872 | Phone: 1-804-327-4559 IBM encoded with E-mail: christian.caruth...@us.ibm.com contact Find me on: LinkedIn: 9201 Arboretum Pkwy information http://www.linkedin.com/profile/view?id=14378571trk=tab_pro Richmond, VA 23236-5402 United States A common mistake that people make when trying to design something completely foolproof is to underestimate the ingenuity of complete fools. - Douglas Adams From: Adam Emerich/Rochester/IBM@IBMUS To: xcat-user@lists.sourceforge.net, Date: 07/25/2012 01:08 AM Subject:[xcat-user] Install stateful node that is also connected to SAN storage I have some stateful nodes in my cluster that are also attached to SAN storage. I have found that when installing the node, the install process accesses the SAN drives and corrupts them. I have the following in the *.tmpl (which came with the xcat install) file which I thought would eliminate this, but I have not been successful in my testing: %pre #!/bin/sh # This will remove the loaded HBA modules from the kernel #modprobe mptsas #modprobe mptscsih #modprobe mptbase #modprobe scsi_transport_sas remove_qla(){ for i in $(lsmod | grep qla | awk .{print $1.}); do echo Will remove: $i /dev/tty1 rmmod $i sleep 1 done } remove_lpfc(){ for i in $(lsmod | grep lpfc | awk .{print $1.}); do echo Will remove: $i /dev/tty1 rmmod $i sleep 1 done } remove_qla sleep 2 remove_qla remove_lpfc echo All qla modules removed sleep 10 #INCLUDE:../scripts/pre.rh# %post #INCLUDE:../scripts/post.rh# Linux: Red Hat Enterprise Linux Server release 6.1 (Santiago) XCaT: 2.6.8 Any help would be greatly appreciated. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user inline: 61708291.jpginline: 61616795.jpginline: 61630149.gifinline: graycol.gif-- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
[xcat-user] Does not continue to bmcsetup once mac is discovered
Hello, When I power on a node, discovery works fine the mac address is added to table mac, and /var/lib/dhcpd/dhcpd.leases is updated. The node renews its IP address changing the discovery address by its final address, but then it loops forever with an Unrecognized directive blank coming from /bin/doxcat because /proc/cmdline does not contain destiny=, actually it still has the old entry with the discovery address. Any idea why it is doing that ? My workaround is : nodeset runcmd=bmcsetup once mac address is discovered and reboot. But it is an extra boot. Thank you for your help. Best regards, Antoine Tabary 17 Avenue De L'europe Certified HPC I/T Specialist Bois Colombes Cedex, 92275 6520AA France ITS e-mail: antoine.tab...@fr.ibm.com Sauf indication contraire ci-dessus:/ Unless stated otherwise above: Compagnie IBM France Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex RCS Nanterre 552 118 465 Forme Sociale : S.A.S. Capital Social : 645.605.931,30 ? SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A image/gif-- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
Re: [xcat-user] Does not continue to bmcsetup once mac is discovered
Hello, Yes there is an entry in the networks table and an alias on the xCAT server so that it can communicate with the client nodes imm where the address is properly configured as shared when I do runcmd=bmcsetup. Actually, the entry in the table was created at installation because the alias existed before. Best regards, Antoine Tabary 17 Avenue De L'europe Certified HPC I/T Specialist Bois Colombes Cedex, 92275 6520AA France ITS e-mail: antoine.tab...@fr.ibm.com From: Wojciech Turek woj...@gmail.com To: xCAT Users Mailing list xcat-user@lists.sourceforge.net Date: 25/07/2012 18:36 Subject:Re: [xcat-user] Does not continue to bmcsetup once mac is discovered Does your network table has a definition for the bmc network? On 25 July 2012 17:22, Antoine Tabary antoine.tab...@fr.ibm.com wrote: Hello, When I power on a node, discovery works fine the mac address is added to table mac, and /var/lib/dhcpd/dhcpd.leases is updated. The node renews its IP address changing the discovery address by its final address, but then it loops forever with an Unrecognized directive blank coming from /bin/doxcat because /proc/cmdline does not contain destiny=, actually it still has the old entry with the discovery address. Any idea why it is doing that ? My workaround is : nodeset runcmd=bmcsetup once mac address is discovered and reboot. But it is an extra boot. Thank you for your help. Best regards, Antoine Tabary 17 Avenue De L'europe Certified HPC I/T Specialist Bois Colombes Cedex, 92275 6520AA France ITS e-mail: antoine.tab...@fr.ibm.com Sauf indication contraire ci-dessus:/ Unless stated otherwise above: Compagnie IBM France Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex RCS Nanterre 552 118 465 Forme Sociale : S.A.S. Capital Social : 645.605.931,30 ? SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user -- -- Wojciech Turek Assistant System Manager High Performance Computing Service University of Cambridge Email: wj...@cam.ac.uk Tel: (+)44 1223 763517 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user Sauf indication contraire ci-dessus:/ Unless stated otherwise above: Compagnie IBM France Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex RCS Nanterre 552 118 465 Forme Sociale : S.A.S. Capital Social : 645.605.931,30 ? SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A image/gifimage/gif-- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
[xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'
Hello, I have some new IBM System X DX360M4 nodes (all our previous ones were DX360M3s) that I am trying to autodiscover with xCAT and I am running into the same problem as in this mail thread: http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html. Essentially the node boots up, does a dhcpdiscover, and grabs a generic bootloader (in my case yaboot but it also works with pxelinux.0) which then reinitiates the dhcpdiscover and queries again for the appropriate bootloader from the file in the xcat/xnba/nets/ folder and is served boots xnba.kpxe. At this point I believe the xnba image is supposed to load and execute the autodiscovery process which includes the getdestiny script, but all I am seeing are the messages: cat: can't open '/tmp/dhcpserver': No such file or directory grep: /tmp/destiny: No such file or directory grep: /tmp/destiny: No such file or directory One reply by Jarrod Johnson to that email thread above mentioned a possible network driver issue and suggested using Genesis from xCAT v2.7. According to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I am running on Centos 5 (2.6.18-128.el5). Is there a way to get this working with my current version of xCAT and OS (inject drivers somehow?), or do I need to update to v2.7 of xCAT (does that require Centos 6)? In any case, is a network driver the most likely explanation for what I am seeing? I have tried everything that I can think of from the switch side to make sure SNMP is enabled and the port definitions are correct in the switch table for autodiscovery. Any ideas? Thanks, Josh Nielsen -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'
Okay, we will be doing an upgrade of xCAT soon. I just wanted to double check. P.S. Can you take a crack at answering that gPXE/UNDI question? On Wed, Jul 25, 2012 at 1:37 PM, Jarrod B Johnson jbjoh...@us.ibm.comwrote: Hmm, with xcat 2.7.3 you should be pulling in the 'xCAT-genesis' packages that replace the environment with something newer that has the appropriate nic drivers... -Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.comwrote: - To: xcat-user@lists.sourceforge.net From: Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.com Date: 07/25/2012 12:59PM Subject: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver' Hello, I have some new IBM System X DX360M4 nodes (all our previous ones were DX360M3s) that I am trying to autodiscover with xCAT and I am running into the same problem as in this mail thread: http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html. Essentially the node boots up, does a dhcpdiscover, and grabs a generic bootloader (in my case yaboot but it also works with pxelinux.0) which then reinitiates the dhcpdiscover and queries again for the appropriate bootloader from the file in the xcat/xnba/nets/ folder and is served boots xnba.kpxe. At this point I believe the xnba image is supposed to load and execute the autodiscovery process which includes the getdestiny script, but all I am seeing are the messages: cat: can't open '/tmp/dhcpserver': No such file or directory grep: /tmp/destiny: No such file or directory grep: /tmp/destiny: No such file or directory One reply by Jarrod Johnson to that email thread above mentioned a possible network driver issue and suggested using Genesis from xCAT v2.7. According to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I am running on Centos 5 (2.6.18-128.el5). Is there a way to get this working with my current version of xCAT and OS (inject drivers somehow?), or do I need to update to v2.7 of xCAT (does that require Centos 6)? In any case, is a network driver the most likely explanation for what I am seeing? I have tried everything that I can think of from the switch side to make sure SNMP is enabled and the port definitions are correct in the switch table for autodiscovery. Any ideas? Thanks, Josh Nielsen -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'
So the issue is that the xnba can work fine. The problem is the linux image that subsequently loads lacked the driver update. In an ideal world, we make an efi executable that does everything in UEFI that is moderately future proof. In a practical world we work with the much richer linux toolset for lack of time to develop EFI resources.-Josh Nielsen jniel...@hudsonalpha.com wrote: -To: xCAT Users Mailing list xcat-user@lists.sourceforge.netFrom: Josh Nielsen jniel...@hudsonalpha.comDate: 07/25/2012 02:58PMSubject: Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'Thanks for the information Lissa.I do have another more general question as well though. It is regarding the xnba boot image itself, which appears to be based off of gPXE. The idea of gPXE is that it is UNDI-capable and does not have to use TFTP to serve the images correct? And looking at the xnba.kpxe image/bootloader extension of '.kpxe' it looks like that is reserved specifically for images that load UNDI but offload PXE. According to this page (http://etherboot.org/wiki/gpxe_imagetypes) the extensions for images break down like this:- .pxe is an image designed to be chainloaded, unloading both the underlying PXE and UNDI code sections.- .kpxe is a PXE image that keeps UNDI loaded and unloads PXE- .kkpxe is a PXE image that keeps PXE+UNDI loaded and return to PXE (instead of int 18h).So does xnba.kpxe try to interact with the NIC card via UNDI once it loads? If so since UNDI is an abstracted API, and unless the API has been updated on the newer NIC cards, shouldn't it work with just about any network card regardless? From this PXE chainloading page (http://etherboot.org/wiki/pxechaining) it says: "When chainloading gPXE from PXE, gPXE can use this API (instead of loading an hardware driver). This way, you're getting support for network controllers that are not natively supported by gPXE. Some network controllers have improved performance when using the UNDI driver over the vendor specific gPXE driver." I'm just curious about some of the theory behind this because I'm only used to good ol' legacy PXE. All this gPXE and UNDI stuff is new to me.On Wed, Jul 25, 2012 at 12:57 PM, Lissa Valletta lis...@us.ibm.com wrote: Version 2.3.x has not been supported for a long time. You need to be on the latest level of 2.6 for support , but you might as well go to the latest release which is 2.7.3. You can upgrade xCAT and stay at your current Centos level. Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 Josh Nielsen ---07/25/2012 01:33:25 PM---Hello, I have some new IBM System X DX360M4 nodes (all our previous ones were From:Josh Nielsen jniel...@hudsonalpha.com To:xcat-user@lists.sourceforge.net Date:07/25/2012 01:33 PM Subject:[xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver' Hello,I have some new IBM System X DX360M4 nodes (all our previous ones were DX360M3s) that I am trying to autodiscover with xCAT and I am running into the same problem as in this mail thread:http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html. Essentially the node boots up, does a dhcpdiscover, and grabs a generic bootloader (in my case yaboot but it also works with pxelinux.0) which then reinitiates the dhcpdiscover and queries again for the appropriate bootloader from the file in the xcat/xnba/nets/ folder and is served bootsxnba.kpxe. At this point I believe the xnba image is supposed to load and execute the autodiscovery process which includes the getdestiny script, but all I am seeing are the messages: cat: can't open '/tmp/dhcpserver': No such file or directory grep: /tmp/destiny: No such file or directory grep: /tmp/destiny: No such file or directory One reply by Jarrod Johnson to that email thread above mentioned a possible network driver issue and suggested using Genesis from xCAT v2.7.According to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I am running onCentos 5 (2.6.18-128.el5). Is there a way to get this working with my current version of xCAT and OS (inject drivers somehow?), or do I need to update to v2.7 of xCAT (does that require Centos 6)? In any case, is a network driver the most likely explanation for what I am seeing? I have tried everything that I can think of from the switch side to make sure SNMP is enabled and the port definitions are correct in the switch table for autodiscovery. Any ideas? Thanks, Josh Nielsen-- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ xCAT-user mailing list xCAT-user@lists.sourceforge.net
Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'
Okay, thanks for the explanation. Cheers, Josh On Wed, Jul 25, 2012 at 2:28 PM, Jarrod B Johnson jbjoh...@us.ibm.comwrote: So the issue is that the xnba can work fine. The problem is the linux image that subsequently loads lacked the driver update. In an ideal world, we make an efi executable that does everything in UEFI that is moderately future proof. In a practical world we work with the much richer linux toolset for lack of time to develop EFI resources. -Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.comwrote: - To: xCAT Users Mailing list xcat-user@lists.sourceforge.netxcat-user@lists.sourceforge.net From: Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.com Date: 07/25/2012 02:58PM Subject: Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver' Thanks for the information Lissa. I do have another more general question as well though. It is regarding the xnba boot image itself, which appears to be based off of gPXE. The idea of gPXE is that it is UNDI-capable and does not have to use TFTP to serve the images correct? And looking at the xnba.kpxe image/bootloader extension of '.kpxe' it looks like that is reserved specifically for images that load UNDI but offload PXE. According to this page ( http://etherboot.org/wiki/gpxe_imagetypes) the extensions for images break down like this: - .pxe is an image designed to be chainloaded, unloading both the underlying PXE and UNDI code sections. - .kpxe is a PXE image that keeps UNDI loaded and unloads PXE - .kkpxe is a PXE image that keeps PXE+UNDI loaded and return to PXE (instead of int 18h). So does xnba.kpxe try to interact with the NIC card via UNDI once it loads? If so since UNDI is an abstracted API, and unless the API has been updated on the newer NIC cards, shouldn't it work with just about any network card regardless? From this PXE chainloading page ( http://etherboot.org/wiki/pxechaining) it says: When chainloading gPXE from PXE, gPXE can use this API (instead of loading an hardware driver). This way, you're getting support for network controllers that are not natively supported by gPXE. Some network controllers have improved performance when using the UNDI driver over the vendor specific gPXE driver. I'm just curious about some of the theory behind this because I'm only used to good ol' legacy PXE. All this gPXE and UNDI stuff is new to me. On Wed, Jul 25, 2012 at 12:57 PM, Lissa Valletta lis...@us.ibm.comwrote: Version 2.3.x has not been supported for a long time. You need to be on the latest level of 2.6 for support , but you might as well go to the latest release which is 2.7.3. You can upgrade xCAT and stay at your current Centos level. Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 Josh Nielsen ---07/25/2012 01:33:25 PM---Hello, I have some new IBM System X DX360M4 nodes (all our previous ones were From: Josh Nielsen jniel...@hudsonalpha.com To: xcat-user@lists.sourceforge.net Date: 07/25/2012 01:33 PM Subject: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver' -- Hello, I have some new IBM System X DX360M4 nodes (all our previous ones were DX360M3s) that I am trying to autodiscover with xCAT and I am running into the same problem as in this mail thread: * http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html *http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html. Essentially the node boots up, does a dhcpdiscover, and grabs a generic bootloader (in my case yaboot but it also works with pxelinux.0) which then reinitiates the dhcpdiscover and queries again for the appropriate bootloader from the file in the xcat/xnba/nets/ folder and is served boots xnba.kpxe. At this point I believe the xnba image is supposed to load and execute the autodiscovery process which includes the getdestiny script, but all I am seeing are the messages: cat: can't open '/tmp/dhcpserver': No such file or directory grep: /tmp/destiny: No such file or directory grep: /tmp/destiny: No such file or directory One reply by Jarrod Johnson to that email thread above mentioned a possible network driver issue and suggested using Genesis from xCAT v2.7. According to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I am running on Centos 5 (2.6.18-128.el5). Is there a way to get this working with my current version of xCAT and OS (inject drivers somehow?), or do I need to update to v2.7 of xCAT (does that require Centos 6)? In any case, is a network driver the most likely explanation for what I am seeing? I have tried everything that I can think of from the switch side to make sure SNMP is enabled and the port definitions are correct in the switch table for autodiscovery. Any ideas? Thanks, Josh Nielsen -- Live Security Virtual Conference
Re: [xcat-user] postscript not run after install?
On Wed, 25 Jul 2012 07:06:39 Lissa Valletta wrote: In fact make sure all files in the /install/postscripts directory and subdirectories are world readable. This can cause a failure that will show up in the wget.log that Guang Cheng Li asked for below. Thanks! Ok I will give you a bit of what you want. Like I said updatenode mynode -P runs the postscripts normally so thses files are currently filled with the information coming from that run. If you need one from a virgin install I'll arrange that. /tmp/wget.log starts with: --2012-07-24 16:35:03-- ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/*.rpm = `.listing' Connecting to 192.168.14.116:21... connected. Logging in as anonymous ... Logged in! == SYST ... done.== PWD ... done. == TYPE I ... done. == CWD /post/otherpkgs/sles11.1/ppc64/loadl ... done. == PASV ... done.== LIST ... done. 0K . 8.08M=0s 2012-07-24 16:35:03 (8.08 MB/s) - `.listing' saved [1313] Removed `.listing'. --2012-07-24 16:35:03-- ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/IBMJava2-142-ppc64- JRE-1.4.2-5.0.ppc64.rpm = `IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm' == CWD not required. == PASV ... done.== RETR IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm ... done. Length: 46118362 (44M) 0K .. .. .. .. .. 0% 48.4M 1s 50K .. .. .. .. .. 0% 75.6M 1s 100K .. .. .. .. .. 0% 89.3M 1s And I cannot see any report of failure in the file. Similarly /var/log/xcat/xcat.log goes like this: Running postscript: syslog Shutting down syslog services..done Starting syslog services..done Running postscript: remoteshell Starting SSH daemon..done Running postscript: syncfiles Did not sync any files. Use updatenode -F to sync the files. Running postscript: setupntp ntpd Shutting down network time protocol daemon (NTPD)..done ntpd: time slew +0.000960s Starting network time protocol daemon (NTPD)..done Running postscript: basic_setup Running postscript: IBMhpc.postscript if test -e /dev/cpuset || mkdir -p /dev/cpuset ; then mount -t cpuset none /dev/cpuset Running postscript: ge.create.fen Running postscript: usrlocalpower7 WARNING - failed to clear out /usr/local directory Running postscript: credential MASTER=192.168.14.116 SITEMASTER=192.168.14.116 MONMASTER=192.168.14.116 Loading repository data... Reading installed packages... 'krb5' is already installed. No update candidate for 'krb5'. Resolving package dependencies... The following NEW packages are going to be installed: krb5-client pam_krb5 2 new packages to install. Overall download size: 211.0 KiB. After the operation, additional 651.0 KiB will be used. Continue? [y/n/?] (y): y Retrieving package pam_krb5-2.3.1-47.10.15.ppc64 (1/2), 118.0 KiB (393.0 KiB unpacked) Retrieving: pam_krb5-2.3.1-47.10.15.ppc64.rpm [done] Installing: pam_krb5-2.3.1-47.10.15 [...done] Retrieving package krb5-client-1.6.3-133.27.1.ppc64 (2/2), 93.0 KiB (258.0 KiB unpacked) Retrieving: krb5-client-1.6.3-133.27.1.ppc64.rpm [done] Installing: krb5-client-1.6.3-133.27.1 [.done] finished setting up the credentials Running postscript: otherpkgs NFSSERVER=192.168.14.116 OTHERPKGDIR=192.168.14.116/post/otherpkgs/sles11.1/ppc64 Repository 'SUSE-Linux-Enterprise-Server-11-SP1 11.1.1-1.152' is up to date. Repository 'SuSE-Linux-SDK' is up to date. All repositories have been refreshed. zypper --non-interactive update --auto-agree-with-license Loading repository data... Reading installed packages... and so on. None of the postscripts involved fail to execute. Francois -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user
Re: [xcat-user] postscript not run after install?
Since you already ran updatenode, then everything in /tmp/wget.log and /var/log/xcat/xcat.log should already be right. Do we still have nodes that just finished the os installation and have not run updatenode? the /tmp/wget.log and /var/log/xcat/xcat.log should include useful hints. Thanks, - Li,Guang Cheng (李光成) IBM China System Technology Laboratory Email: ligua...@cn.ibm.com Address: Building 28, ZhongGuanCun Software Park, No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC 北京市海淀区东北旺西路8号中关村软件园28号楼 邮编: 100193 From: François Bissey francois.bis...@canterbury.ac.nz To: xcat-user@lists.sourceforge.net Date: 2012-07-26 06:37 Subject:Re: [xcat-user] postscript not run after install? On Wed, 25 Jul 2012 07:06:39 Lissa Valletta wrote: In fact make sure all files in the /install/postscripts directory and subdirectories are world readable. This can cause a failure that will show up in the wget.log that Guang Cheng Li asked for below. Thanks! Ok I will give you a bit of what you want. Like I said updatenode mynode -P runs the postscripts normally so thses files are currently filled with the information coming from that run. If you need one from a virgin install I'll arrange that. /tmp/wget.log starts with: --2012-07-24 16:35:03-- ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/*.rpm = `.listing' Connecting to 192.168.14.116:21... connected. Logging in as anonymous ... Logged in! == SYST ... done.== PWD ... done. == TYPE I ... done. == CWD /post/otherpkgs/sles11.1/ppc64/loadl ... done. == PASV ... done.== LIST ... done. 0K . 8.08M=0s 2012-07-24 16:35:03 (8.08 MB/s) - `.listing' saved [1313] Removed `.listing'. --2012-07-24 16:35:03-- ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/IBMJava2-142-ppc64- JRE-1.4.2-5.0.ppc64.rpm = `IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm' == CWD not required. == PASV ... done.== RETR IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm ... done. Length: 46118362 (44M) 0K .. .. .. .. .. 0% 48.4M 1s 50K .. .. .. .. .. 0% 75.6M 1s 100K .. .. .. .. .. 0% 89.3M 1s And I cannot see any report of failure in the file. Similarly /var/log/xcat/xcat.log goes like this: Running postscript: syslog Shutting down syslog services..done Starting syslog services..done Running postscript: remoteshell Starting SSH daemon..done Running postscript: syncfiles Did not sync any files. Use updatenode -F to sync the files. Running postscript: setupntp ntpd Shutting down network time protocol daemon (NTPD)..done ntpd: time slew +0.000960s Starting network time protocol daemon (NTPD)..done Running postscript: basic_setup Running postscript: IBMhpc.postscript if test -e /dev/cpuset || mkdir -p /dev/cpuset ; then mount -t cpuset none /dev/cpuset Running postscript: ge.create.fen Running postscript: usrlocalpower7 WARNING - failed to clear out /usr/local directory Running postscript: credential MASTER=192.168.14.116 SITEMASTER=192.168.14.116 MONMASTER=192.168.14.116 Loading repository data... Reading installed packages... 'krb5' is already installed. No update candidate for 'krb5'. Resolving package dependencies... The following NEW packages are going to be installed: krb5-client pam_krb5 2 new packages to install. Overall download size: 211.0 KiB. After the operation, additional 651.0 KiB will be used. Continue? [y/n/?] (y): y Retrieving package pam_krb5-2.3.1-47.10.15.ppc64 (1/2), 118.0 KiB (393.0 KiB unpacked) Retrieving: pam_krb5-2.3.1-47.10.15.ppc64.rpm [done] Installing: pam_krb5-2.3.1-47.10.15 [...done] Retrieving package krb5-client-1.6.3-133.27.1.ppc64 (2/2), 93.0 KiB (258.0 KiB unpacked) Retrieving: krb5-client-1.6.3-133.27.1.ppc64.rpm [done] Installing: krb5-client-1.6.3-133.27.1 [.done] finished setting up the credentials Running postscript: otherpkgs NFSSERVER=192.168.14.116 OTHERPKGDIR=192.168.14.116/post/otherpkgs/sles11.1/ppc64 Repository 'SUSE-Linux-Enterprise-Server-11-SP1 11.1.1-1.152' is up to date. Repository 'SuSE-Linux-SDK' is up to date. All repositories have been refreshed. zypper --non-interactive update --auto-agree-with-license Loading repository data... Reading installed packages... and so on. None of the postscripts involved fail to execute. Francois -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___