Re: [xcat-user] postscript not run after install?

2012-07-25 Thread Lissa Valletta
In fact make sure all  files in the /install/postscripts directory and
subdirectories are world readable. This can cause a failure that will
show up in the wget.log that Guang Cheng Li asked for below.
Thanks!

Lissa K. Valletta
2-3/T12
Poughkeepsie, NY 12601
(tie 293) 433-3102





From:   Lissa Valletta/Poughkeepsie/IBM@IBMUS
To: xCAT Users Mailing list xcat-user@lists.sourceforge.net
Cc: xCAT Users Mailing list xcat-user@lists.sourceforge.net
Date:   07/25/2012 07:03 AM
Subject:Re: [xcat-user] postscript not run after install?



Do  the postscripts you added  to /install/postscripts  have the correct
permission.  They must be world-readable and executable at least by root.
Check against our postscripts, something like
-rwxr-xr-x 1 root root 9869 Jul  3 09:50 syslog

Also run lsdef nodename
what is in the postscripts attribute?


Lissa K. Valletta
2-3/T12
Poughkeepsie, NY 12601
(tie 293) 433-3102



Inactive hide details for Guang Cheng Li ---07/25/2012 12:14:37 AM---Before
the postscripts are run, the postscripts need to beGuang Cheng Li
---07/25/2012 12:14:37 AM---Before the postscripts are run, the postscripts
need to be downloaded from the management node or se

From: Guang Cheng Li ligua...@cn.ibm.com
To: xCAT Users Mailing list xcat-user@lists.sourceforge.net
Date: 07/25/2012 12:14 AM
Subject: Re: [xcat-user] postscript not run after install?



Before the postscripts are run, the postscripts need to be downloaded from
the management node or service node, if the download fails, the postscripts
will not be run at all, at least seems to not run at all.

Could you please post the content of /tmp/wget.log
and /var/log/xcat/xcat.log on the compute node?

Thanks,
-
Li,Guang Cheng (李光成)
IBM China System Technology Laboratory
Email: ligua...@cn.ibm.com
Address: Building 28, ZhongGuanCun Software Park,
No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC

北京市海淀区东北旺西路8号中关村软件园28号楼
邮编: 100193

Inactive hide details for SYED ASIF ZAHEER ---2012-07-25 12:04:46---From:
SYED ASIF ZAHEER sa_zah...@hotmail.com To: XCAT HelSYED ASIF ZAHEER
---2012-07-25 12:04:46---From: SYED ASIF ZAHEER sa_zah...@hotmail.com To:
XCAT Help xcat-user@lists.sourceforge.net

From: SYED ASIF ZAHEER sa_zah...@hotmail.com
To: XCAT Help xcat-user@lists.sourceforge.net
Date: 2012-07-25 12:04
Subject: Re: [xcat-user] postscript not run after install?





did you add your script to run in postscripts table

tabedit postscripts


[root@xgplg ~]# tabdump postscripts
#node,postscripts,postbootscripts,comments,disable
xcatdefaults,syslog,remoteshell,syncfiles,otherpkgs,,
service,servicenode,xcatserver,xcatclient,,,
compute,script_filename.sh,,,






Regards
Syed Asif Zaheer
sa_zah...@hotmail.com




 Date: Wed, 25 Jul 2012 14:26:34 +1200
 From: francois.bis...@canterbury.ac.nz
 To: xcat-user@lists.sourceforge.net
 Subject: Re: [xcat-user] postscript not run after install?

 On Wed, 25 Jul 2012 10:16:40 Guang Cheng Li wrote:
  HI,
 
  A lot of reasons might cause postscripts could not be run successfully,
  like the compute node can not resolve the hostname of the management
node.
  Try the following debug procedure:
 
  1. Anything in /tmp/wget.log? is the node trying to get the postscripts
  from the correct server?
 
  2. /var/log/xcat/xcat.log, it might indicate what is going wrong
 

 In both case I only see messages coming from the time I ran
 updatenode nodename -P
 which does run fine. As far as I can tell it is not that the postscripts
 fail but they are not run at all after the install is finished.
 All I am seeing now is
 install
 reboot
 finish the install
 run postbootscript

 At no point are the postscripts run. I am not saying failling, I am
saying
 not being run at all.

 Dr Francois Bissey
 BlueFern team




--

 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions

 will include endpoint security, mobile security and the latest in malware

 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 xCAT-user mailing list
 xCAT-user@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/xcat-user
--

Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Re: [xcat-user] Install stateful node that is also connected to SAN storage

2012-07-25 Thread Christian Caruthers

In the past, I added 'nostorage' to bootparams.addkcmdline and then in the
KS template added:

For RHEL = 5.x:
device scsi mptsas

In the ongoing effort to fix that which is not broken, Red Hat apparently
removed this from kickstart, but I found the following which might prove
helpful:

https://www.redhat.com/archives/rhelv6-beta-list/2010-April/msg00090.html

Don't have anything to test it on, though. YMMV since it's talking about
6-beta and the nostorage option was accidentally left in the documentation
after 6 was released.

Also might be useful:
http://osdir.com/ml/kickstart-list/2012-02/msg00010.html

Regards,

Christian D. Caruthers
Senior Consultant - System x Linux HPC
   
   
   
 2D barcode - Mobile: 1-757-289-9872 | Phone: 1-804-327-4559
IBM
 encoded with E-mail: christian.caruth...@us.ibm.com   
 contact  Find me on: LinkedIn: 
9201 Arboretum Pkwy
 information  http://www.linkedin.com/profile/view?id=14378571trk=tab_pro  
Richmond, VA 23236-5402

  United States
  A common mistake that people make when trying to design 
  something completely foolproof is to underestimate the   
  ingenuity of complete fools. - Douglas Adams
   





From:   Adam Emerich/Rochester/IBM@IBMUS
To: xcat-user@lists.sourceforge.net,
Date:   07/25/2012 01:08 AM
Subject:[xcat-user] Install stateful node that is also connected to SAN
storage



I have some stateful nodes in my cluster that are also attached to SAN
storage.  I have found that when installing the node,
the install process accesses the SAN drives and corrupts them.  I have the
following in the *.tmpl (which came with the xcat install)
file which I thought would eliminate this, but I have not been successful
in my testing:

%pre
#!/bin/sh
# This will remove the loaded HBA modules from the kernel
#modprobe mptsas
#modprobe mptscsih
#modprobe mptbase
#modprobe scsi_transport_sas
remove_qla(){
for i in $(lsmod | grep qla | awk .{print $1.}); do
echo Will remove: $i  /dev/tty1
rmmod $i
sleep 1
done
}

remove_lpfc(){
for i in $(lsmod | grep lpfc | awk .{print $1.}); do
echo Will remove: $i  /dev/tty1
rmmod $i
sleep 1
done
}

remove_qla
sleep 2
remove_qla
remove_lpfc
echo All qla modules removed
sleep 10
#INCLUDE:../scripts/pre.rh#
%post
#INCLUDE:../scripts/post.rh#

Linux:  Red Hat Enterprise Linux Server release 6.1 (Santiago)
XCaT: 2.6.8

Any help would be greatly appreciated.
--

Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
inline: 61708291.jpginline: 61616795.jpginline: 61630149.gifinline: graycol.gif--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Does not continue to bmcsetup once mac is discovered

2012-07-25 Thread Antoine Tabary
Hello,

When I power on a node, discovery works fine the mac address is added to 
table mac, and /var/lib/dhcpd/dhcpd.leases is updated. The node renews its 
IP address changing the discovery address by its final address, but then 
it loops forever with an Unrecognized directive blank coming from 
/bin/doxcat because /proc/cmdline does not contain destiny=, actually it 
still has the old entry with the discovery address.

Any idea why it is doing that ?

My workaround is : nodeset runcmd=bmcsetup once mac address is discovered 
and reboot. But it is an extra boot.

Thank you for your help.

Best regards,



Antoine Tabary
 17 Avenue De L'europe

Certified HPC I/T Specialist
 Bois Colombes Cedex, 92275
6520AA
 France
ITS
 



 



 

e-mail:
antoine.tab...@fr.ibm.com
 


 
 


Sauf indication contraire ci-dessus:/ Unless stated otherwise above:
Compagnie IBM France
Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex
RCS Nanterre 552 118 465
Forme Sociale : S.A.S.
Capital Social : 645.605.931,30 ?
SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A image/gif--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Does not continue to bmcsetup once mac is discovered

2012-07-25 Thread Antoine Tabary
Hello,

Yes there is an entry in the networks table and an alias on the xCAT 
server so that it can communicate with the client nodes imm where the 
address is properly configured as shared when I do runcmd=bmcsetup. 
Actually, the entry in the table was created at installation because the 
alias existed before.

Best regards,



Antoine Tabary
 17 Avenue De L'europe

Certified HPC I/T Specialist
 Bois Colombes Cedex, 92275
6520AA
 France
ITS
 



 



 

e-mail:
antoine.tab...@fr.ibm.com
 


 
 




From:   Wojciech Turek woj...@gmail.com
To: xCAT Users Mailing list xcat-user@lists.sourceforge.net
Date:   25/07/2012 18:36
Subject:Re: [xcat-user] Does not continue to bmcsetup once mac is 
discovered



Does your network table has a definition for the bmc network?

On 25 July 2012 17:22, Antoine Tabary antoine.tab...@fr.ibm.com wrote:
Hello,

When I power on a node, discovery works fine the mac address is added to 
table mac, and /var/lib/dhcpd/dhcpd.leases is updated. The node renews its 
IP address changing the discovery address by its final address, but then 
it loops forever with an Unrecognized directive blank coming from 
/bin/doxcat because /proc/cmdline does not contain destiny=, actually it 
still has the old entry with the discovery address. 

Any idea why it is doing that ? 

My workaround is : nodeset runcmd=bmcsetup once mac address is discovered 
and reboot. But it is an extra boot. 

Thank you for your help. 

Best regards, 


Antoine Tabary 
 17 Avenue De L'europe 

Certified HPC I/T Specialist 
 Bois Colombes Cedex, 92275 
6520AA 
 France 
ITS 
  



  



  

e-mail: 
antoine.tab...@fr.ibm.com 
  


  
 



Sauf indication contraire ci-dessus:/ Unless stated otherwise above:
Compagnie IBM France
Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex
RCS Nanterre 552 118 465
Forme Sociale : S.A.S.
Capital Social : 645.605.931,30 ?
SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A 
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user




-- 
--
Wojciech Turek

Assistant System Manager

High Performance Computing Service
University of Cambridge
Email: wj...@cam.ac.uk
Tel: (+)44 1223 763517 
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user



Sauf indication contraire ci-dessus:/ Unless stated otherwise above:
Compagnie IBM France
Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex
RCS Nanterre 552 118 465
Forme Sociale : S.A.S.
Capital Social : 645.605.931,30 ?
SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A image/gifimage/gif--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

2012-07-25 Thread Josh Nielsen
Hello,

I have some new IBM System X DX360M4 nodes (all our previous ones were
DX360M3s) that I am trying to autodiscover with xCAT and I am running into
the same problem as in this mail thread:
http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html.
Essentially the node boots up, does a dhcpdiscover, and grabs a generic
bootloader (in my case yaboot but it also works with pxelinux.0) which then
reinitiates the dhcpdiscover and queries again for the appropriate
bootloader from the file in the xcat/xnba/nets/ folder and is served 
boots xnba.kpxe. At this point I believe the xnba image is supposed to load
and execute the autodiscovery process which includes the getdestiny script,
but all I am seeing are the messages:

cat: can't open '/tmp/dhcpserver': No such file or directory
grep: /tmp/destiny: No such file or directory
grep: /tmp/destiny: No such file or directory

One reply by Jarrod Johnson to that email thread above mentioned a possible
network driver issue and suggested using Genesis from xCAT v2.7. According
to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I am running
on Centos 5 (2.6.18-128.el5). Is there a way to get this working with my
current version of xCAT and OS (inject drivers somehow?), or do I need to
update to v2.7 of xCAT (does that require Centos 6)? In any case, is a
network driver the most likely explanation for what I am seeing? I have
tried everything that I can think of from the switch side to make sure SNMP
is enabled and the port definitions are correct in the switch table for
autodiscovery. Any ideas?

Thanks,
Josh Nielsen
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

2012-07-25 Thread Josh Nielsen
Okay, we will be doing an upgrade of xCAT soon. I just wanted to double
check.

P.S. Can you take a crack at answering that gPXE/UNDI question?

On Wed, Jul 25, 2012 at 1:37 PM, Jarrod B Johnson jbjoh...@us.ibm.comwrote:

 Hmm, with xcat 2.7.3 you should be pulling in the 'xCAT-genesis' packages
 that replace the environment with something newer that has the appropriate
 nic drivers...
 -Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.comwrote: 
 -

 To: xcat-user@lists.sourceforge.net
 From: Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.com
 Date: 07/25/2012 12:59PM
 Subject: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

 Hello,

 I have some new IBM System X DX360M4 nodes (all our previous ones were
 DX360M3s) that I am trying to autodiscover with xCAT and I am running into
 the same problem as in this mail thread:
 http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html.
 Essentially the node boots up, does a dhcpdiscover, and grabs a generic
 bootloader (in my case yaboot but it also works with pxelinux.0) which then
 reinitiates the dhcpdiscover and queries again for the appropriate
 bootloader from the file in the xcat/xnba/nets/ folder and is served 
 boots xnba.kpxe. At this point I believe the xnba image is supposed to load
 and execute the autodiscovery process which includes the getdestiny script,
 but all I am seeing are the messages:


 cat: can't open '/tmp/dhcpserver': No such file or directory
 grep: /tmp/destiny: No such file or directory
 grep: /tmp/destiny: No such file or directory


 One reply by Jarrod Johnson to that email thread above mentioned a
 possible network driver issue and suggested using Genesis from xCAT
 v2.7. According to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I
 am running on Centos 5 (2.6.18-128.el5). Is there a way to get this working
 with my current version of xCAT and OS (inject drivers somehow?), or do I
 need to update to v2.7 of xCAT (does that require Centos 6)? In any case,
 is a network driver the most likely explanation for what I am seeing? I
 have tried everything that I can think of from the switch side to make sure
 SNMP is enabled and the port definitions are correct in the switch table
 for autodiscovery. Any ideas?


 Thanks,
 Josh Nielsen


 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 xCAT-user mailing list
 xCAT-user@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/xcat-user




 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 xCAT-user mailing list
 xCAT-user@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/xcat-user


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

2012-07-25 Thread Jarrod B Johnson
So the issue is that the xnba can work fine. The problem is the linux image that subsequently loads lacked the driver update. In an ideal world, we make an efi executable that does everything in UEFI that is moderately future proof. In a practical world we work with the much richer linux toolset for lack of time to develop EFI resources.-Josh Nielsen jniel...@hudsonalpha.com wrote: -To: xCAT Users Mailing list xcat-user@lists.sourceforge.netFrom: Josh Nielsen jniel...@hudsonalpha.comDate: 07/25/2012 02:58PMSubject: Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'Thanks for the information Lissa.I do have another more general question as well though. It is regarding the xnba boot image itself, which appears to be based off of gPXE. The idea of gPXE is that it is UNDI-capable and does not have to use TFTP to serve the images correct? And looking at the xnba.kpxe image/bootloader extension of '.kpxe' it looks like that is reserved specifically for images that load UNDI but offload PXE. According to this page (http://etherboot.org/wiki/gpxe_imagetypes) the extensions for images break down like this:- .pxe is an image designed to be chainloaded, unloading both the underlying PXE and UNDI code sections.- .kpxe is a PXE image that keeps UNDI loaded and unloads PXE- .kkpxe is a PXE image that keeps PXE+UNDI loaded and return to PXE (instead of int 18h).So does xnba.kpxe try to interact with the NIC card via UNDI once it loads? If so since UNDI is an abstracted API, and unless the API has been updated on the newer NIC cards, shouldn't it work with just about any network card regardless? From this PXE chainloading page (http://etherboot.org/wiki/pxechaining) it says:
"When chainloading gPXE from PXE, gPXE can use this API (instead of loading an hardware driver). This way, you're getting support for network controllers that are not natively supported by gPXE. Some network controllers have improved performance when using the UNDI driver over the vendor specific gPXE driver."
I'm just curious about some of the theory behind this because I'm only used to good ol' legacy PXE. All this gPXE and UNDI stuff is new to me.On Wed, Jul 25, 2012 at 12:57 PM, Lissa Valletta lis...@us.ibm.com wrote:
Version 2.3.x has not been supported for a long time. You need to be on the latest level of 2.6 for support , but you might as well go to the latest release which is 2.7.3.   You can upgrade xCAT and stay at your current Centos level. 


Lissa K. Valletta
2-3/T12
Poughkeepsie, NY 12601
(tie 293) 433-3102

Josh Nielsen ---07/25/2012 01:33:25 PM---Hello, I have some new IBM System X DX360M4 nodes (all our previous ones were

From:Josh Nielsen jniel...@hudsonalpha.com
To:xcat-user@lists.sourceforge.net
Date:07/25/2012 01:33 PM
Subject:[xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

Hello,I have some new IBM System X DX360M4 nodes (all our previous ones were DX360M3s) that I am trying to autodiscover with xCAT and I am running into the same problem as in this mail thread:http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html. Essentially the node boots up, does a dhcpdiscover, and grabs a generic bootloader (in my case yaboot but it also works with pxelinux.0) which then reinitiates the dhcpdiscover and queries again for the appropriate bootloader from the file in the xcat/xnba/nets/ folder and is served  bootsxnba.kpxe. At this point I believe the xnba image is supposed to load and execute the autodiscovery process which includes the getdestiny script, but all I am seeing are the messages:
cat: can't open '/tmp/dhcpserver': No such file or directory
grep: /tmp/destiny: No such file or directory
grep: /tmp/destiny: No such file or directory
One reply by Jarrod Johnson to that email thread above mentioned a possible network driver issue and suggested using Genesis from xCAT v2.7.According to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I am running onCentos 5 (2.6.18-128.el5). Is there a way to get this working with my current version of xCAT and OS (inject drivers somehow?), or do I need to update to v2.7 of xCAT (does that require Centos 6)? In any case, is a network driver the most likely explanation for what I am seeing? I have tried everything that I can think of from the switch side to make sure SNMP is enabled and the port definitions are correct in the switch table for autodiscovery. Any ideas?
Thanks,
Josh Nielsen--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___

xCAT-user mailing list
xCAT-user@lists.sourceforge.net

Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

2012-07-25 Thread Josh Nielsen
Okay, thanks for the explanation.

Cheers,
Josh

On Wed, Jul 25, 2012 at 2:28 PM, Jarrod B Johnson jbjoh...@us.ibm.comwrote:

 So the issue is that the xnba can work fine.  The problem is the linux
 image that subsequently loads lacked the driver update.  In an ideal world,
 we make an efi executable that does everything in UEFI that is moderately
 future proof.  In a practical world we work with the much richer linux
 toolset for lack of time to develop EFI resources.

 -Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.comwrote: 
 -

 To: xCAT Users Mailing list 
 xcat-user@lists.sourceforge.netxcat-user@lists.sourceforge.net
 From: Josh Nielsen jniel...@hudsonalpha.com jniel...@hudsonalpha.com
 Date: 07/25/2012 02:58PM
 Subject: Re: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

 Thanks for the information Lissa.

 I do have another more general question as well though. It is regarding
 the xnba boot image itself, which appears to be based off of gPXE. The idea
 of gPXE is that it is UNDI-capable and does not have to use TFTP to serve
 the images correct? And looking at the xnba.kpxe image/bootloader extension
 of '.kpxe' it looks like that is reserved specifically for images that load
 UNDI but offload PXE. According to this page (
 http://etherboot.org/wiki/gpxe_imagetypes) the extensions for images
 break down like this:

 - .pxe is an image designed to be chainloaded, unloading both the
 underlying PXE and UNDI code sections.
 - .kpxe is a PXE image that keeps UNDI loaded and unloads PXE
 - .kkpxe is a PXE image that keeps PXE+UNDI loaded and return to PXE
 (instead of int 18h).

 So does xnba.kpxe try to interact with the NIC card via UNDI once it
 loads? If so since UNDI is an abstracted API, and unless the API has been
 updated on the newer NIC cards, shouldn't it work with just about any
 network card regardless? From this PXE chainloading page (
 http://etherboot.org/wiki/pxechaining) it says:

 When chainloading gPXE from PXE, gPXE can use this API (instead of
 loading an hardware driver). This way, you're getting support for network
 controllers that are not natively supported by gPXE. Some network
 controllers have improved performance when using the UNDI driver over the
 vendor specific gPXE driver.
 I'm just curious about some of the theory behind this because I'm only
 used to good ol' legacy PXE. All this gPXE and UNDI stuff is new to me.


 On Wed, Jul 25, 2012 at 12:57 PM, Lissa Valletta lis...@us.ibm.comwrote:


 Version 2.3.x has not been supported for a long time.  You need to be on
 the latest level of 2.6 for support , but you might as well  go to the
 latest release which is  2.7.3. You can upgrade xCAT and stay at your
 current Centos level.

 Lissa K. Valletta
 2-3/T12
 Poughkeepsie, NY 12601
 (tie 293) 433-3102



 Josh Nielsen ---07/25/2012 01:33:25 PM---Hello, I have some new IBM
 System X DX360M4 nodes (all our previous ones were

 From: Josh Nielsen jniel...@hudsonalpha.com
 To: xcat-user@lists.sourceforge.net
 Date: 07/25/2012 01:33 PM
 Subject: [xcat-user] Getdestiny failing - can't open '/tmp/dhcpserver'

 --



 Hello,
 I have some new IBM System X DX360M4 nodes (all our previous ones were
 DX360M3s) that I am trying to autodiscover with xCAT and I am running into
 the same problem as in this mail thread: *
 http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html
 *http://www.mail-archive.com/xcat-user@lists.sourceforge.net/msg01267.html.
 Essentially the node boots up, does a dhcpdiscover, and grabs a generic
 bootloader (in my case yaboot but it also works with pxelinux.0) which then
 reinitiates the dhcpdiscover and queries again for the appropriate
 bootloader from the file in the xcat/xnba/nets/ folder and is served 
 boots xnba.kpxe. At this point I believe the xnba image is supposed to load
 and execute the autodiscovery process which includes the getdestiny script,
 but all I am seeing are the messages:

 cat: can't open '/tmp/dhcpserver': No such file or directory
 grep: /tmp/destiny: No such file or directory
 grep: /tmp/destiny: No such file or directory

 One reply by Jarrod Johnson to that email thread above mentioned a
 possible network driver issue and suggested using Genesis from xCAT
 v2.7. According to 'xcatconfig -v' I am running Version 2.3.1 of xCAT and I
 am running on Centos 5 (2.6.18-128.el5). Is there a way to get this working
 with my current version of xCAT and OS (inject drivers somehow?), or do I
 need to update to v2.7 of xCAT (does that require Centos 6)? In any case,
 is a network driver the most likely explanation for what I am seeing? I
 have tried everything that I can think of from the switch side to make sure
 SNMP is enabled and the port definitions are correct in the switch table
 for autodiscovery. Any ideas?

 Thanks,
 Josh Nielsen
 --
 Live Security Virtual Conference
 

Re: [xcat-user] postscript not run after install?

2012-07-25 Thread François Bissey
On Wed, 25 Jul 2012 07:06:39 Lissa Valletta wrote:
 In fact make sure all  files in the /install/postscripts directory and
 subdirectories are world readable. This can cause a failure that will
 show up in the wget.log that Guang Cheng Li asked for below.
 Thanks!
 
Ok I will give you a bit of what you want. Like I said 
updatenode mynode -P
runs the postscripts normally so thses files are currently filled with the 
information coming from that run. If you need one from a virgin install
I'll arrange that.
/tmp/wget.log  starts with:
--2012-07-24 16:35:03--  
ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/*.rpm
   = `.listing'
Connecting to 192.168.14.116:21... connected.
Logging in as anonymous ... Logged in!
== SYST ... done.== PWD ... done.
== TYPE I ... done.  == CWD /post/otherpkgs/sles11.1/ppc64/loadl ... done.
== PASV ... done.== LIST ... done.

 0K .  8.08M=0s

2012-07-24 16:35:03 (8.08 MB/s) - `.listing' saved [1313]

Removed `.listing'.
--2012-07-24 16:35:03--  
ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/IBMJava2-142-ppc64-
JRE-1.4.2-5.0.ppc64.rpm
   = `IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm'
== CWD not required.
== PASV ... done.== RETR IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm ... 
done.
Length: 46118362 (44M)

 0K .. .. .. .. ..  0% 48.4M 1s
50K .. .. .. .. ..  0% 75.6M 1s
   100K .. .. .. .. ..  0% 89.3M 1s

And I cannot see any report of failure in the file.

Similarly /var/log/xcat/xcat.log goes like this:
Running postscript: syslog
Shutting down syslog services..done
Starting syslog services..done
Running postscript: remoteshell

Starting SSH daemon..done
Running postscript: syncfiles
  Did not sync any files. Use updatenode -F to sync the files.
Running postscript: setupntp
ntpd
Shutting down network time protocol daemon (NTPD)..done
ntpd: time slew +0.000960s
Starting network time protocol daemon (NTPD)..done
Running postscript: basic_setup
Running postscript: IBMhpc.postscript
if test -e /dev/cpuset || mkdir -p /dev/cpuset ; then
mount -t cpuset none /dev/cpuset
Running postscript: ge.create.fen
Running postscript: usrlocalpower7
WARNING - failed to clear out /usr/local directory
Running postscript: credential
MASTER=192.168.14.116
SITEMASTER=192.168.14.116
MONMASTER=192.168.14.116
Loading repository data...
Reading installed packages...
'krb5' is already installed.
No update candidate for 'krb5'.
Resolving package dependencies...

The following NEW packages are going to be installed:
  krb5-client pam_krb5 

2 new packages to install.
Overall download size: 211.0 KiB. After the operation, additional 651.0 KiB 
will be used.
Continue? [y/n/?] (y): y
Retrieving package pam_krb5-2.3.1-47.10.15.ppc64 (1/2), 118.0 KiB (393.0 KiB 
unpacked)
Retrieving: pam_krb5-2.3.1-47.10.15.ppc64.rpm [done]
Installing: pam_krb5-2.3.1-47.10.15 [...done]
Retrieving package krb5-client-1.6.3-133.27.1.ppc64 (2/2), 93.0 KiB (258.0 KiB 
unpacked)
Retrieving: krb5-client-1.6.3-133.27.1.ppc64.rpm [done]
Installing: krb5-client-1.6.3-133.27.1 [.done]
finished setting up the credentials
Running postscript: otherpkgs
NFSSERVER=192.168.14.116
OTHERPKGDIR=192.168.14.116/post/otherpkgs/sles11.1/ppc64
Repository 'SUSE-Linux-Enterprise-Server-11-SP1 11.1.1-1.152' is up to date.
Repository 'SuSE-Linux-SDK' is up to date.
All repositories have been refreshed.
zypper --non-interactive update --auto-agree-with-license
Loading repository data...
Reading installed packages...


and so on. None of the postscripts involved fail to execute.

Francois


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] postscript not run after install?

2012-07-25 Thread Guang Cheng Li
Since you already ran updatenode, then everything in /tmp/wget.log
and /var/log/xcat/xcat.log should already be right. Do we still have nodes
that just finished the os installation and have not run updatenode?
the /tmp/wget.log and /var/log/xcat/xcat.log should include useful hints.

Thanks,
-
 Li,Guang Cheng (李光成)
 IBM China System Technology Laboratory

 Email: ligua...@cn.ibm.com
 Address: Building 28, ZhongGuanCun Software Park,
  No.8, Dong Bei Wang West Road, Haidian District Beijing 100193,
PRC

北京市海淀区东北旺西路8号中关村软件园28号楼
邮编: 100193



From:   François Bissey francois.bis...@canterbury.ac.nz
To: xcat-user@lists.sourceforge.net
Date:   2012-07-26 06:37
Subject:Re: [xcat-user] postscript not run after install?



On Wed, 25 Jul 2012 07:06:39 Lissa Valletta wrote:
 In fact make sure all  files in the /install/postscripts directory and
 subdirectories are world readable. This can cause a failure that will
 show up in the wget.log that Guang Cheng Li asked for below.
 Thanks!

Ok I will give you a bit of what you want. Like I said
updatenode mynode -P
runs the postscripts normally so thses files are currently filled with the
information coming from that run. If you need one from a virgin install
I'll arrange that.
/tmp/wget.log  starts with:
--2012-07-24 16:35:03--
ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/*.rpm
   = `.listing'
Connecting to 192.168.14.116:21... connected.
Logging in as anonymous ... Logged in!
== SYST ... done.== PWD ... done.
== TYPE I ... done.  == CWD /post/otherpkgs/sles11.1/ppc64/loadl ...
done.
== PASV ... done.== LIST ... done.

 0K .  8.08M=0s

2012-07-24 16:35:03 (8.08 MB/s) - `.listing' saved [1313]

Removed `.listing'.
--2012-07-24 16:35:03--
ftp://192.168.14.116/post/otherpkgs/sles11.1/ppc64/loadl/IBMJava2-142-ppc64-

JRE-1.4.2-5.0.ppc64.rpm
   = `IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm'
== CWD not required.
== PASV ... done.== RETR
IBMJava2-142-ppc64-JRE-1.4.2-5.0.ppc64.rpm ...
done.
Length: 46118362 (44M)

 0K .. .. .. .. ..  0% 48.4M 1s
50K .. .. .. .. ..  0% 75.6M 1s
   100K .. .. .. .. ..  0% 89.3M 1s

And I cannot see any report of failure in the file.

Similarly /var/log/xcat/xcat.log goes like this:
Running postscript: syslog
Shutting down syslog services..done
Starting syslog services..done
Running postscript: remoteshell

Starting SSH daemon..done
Running postscript: syncfiles
  Did not sync any files. Use updatenode -F to sync the files.
Running postscript: setupntp
ntpd
Shutting down network time protocol daemon (NTPD)..done
ntpd: time slew +0.000960s
Starting network time protocol daemon (NTPD)..done
Running postscript: basic_setup
Running postscript: IBMhpc.postscript
if test -e /dev/cpuset || mkdir -p /dev/cpuset ; then
mount -t cpuset none /dev/cpuset
Running postscript: ge.create.fen
Running postscript: usrlocalpower7
WARNING - failed to clear out /usr/local directory
Running postscript: credential
MASTER=192.168.14.116
SITEMASTER=192.168.14.116
MONMASTER=192.168.14.116
Loading repository data...
Reading installed packages...
'krb5' is already installed.
No update candidate for 'krb5'.
Resolving package dependencies...

The following NEW packages are going to be installed:
  krb5-client pam_krb5

2 new packages to install.
Overall download size: 211.0 KiB. After the operation, additional 651.0 KiB

will be used.
Continue? [y/n/?] (y): y
Retrieving package pam_krb5-2.3.1-47.10.15.ppc64 (1/2), 118.0 KiB (393.0
KiB
unpacked)
Retrieving: pam_krb5-2.3.1-47.10.15.ppc64.rpm [done]
Installing: pam_krb5-2.3.1-47.10.15 [...done]
Retrieving package krb5-client-1.6.3-133.27.1.ppc64 (2/2), 93.0 KiB (258.0
KiB
unpacked)
Retrieving: krb5-client-1.6.3-133.27.1.ppc64.rpm [done]
Installing: krb5-client-1.6.3-133.27.1 [.done]
finished setting up the credentials
Running postscript: otherpkgs
NFSSERVER=192.168.14.116
OTHERPKGDIR=192.168.14.116/post/otherpkgs/sles11.1/ppc64
Repository 'SUSE-Linux-Enterprise-Server-11-SP1 11.1.1-1.152' is up to
date.
Repository 'SuSE-Linux-SDK' is up to date.
All repositories have been refreshed.
zypper --non-interactive update --auto-agree-with-license
Loading repository data...
Reading installed packages...


and so on. None of the postscripts involved fail to execute.

Francois


--

Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___