Could you show out the kernel parameters which used during the booting? From code that the command 'wget http://9.3.189.138/install/netboot/sles11/ppc64/compute/rootimg.gz' which should be run to download the rootimg.gz to the node was not run or failed.
Looks like in your environment that keyword imgurl was not gotten from
the /proc/cmdline, or something else? You can take a debug.
for i in `cat /proc/cmdline`; do
KEY=`echo \$i |awk -F= '{print \$1}'`
if [ "\$KEY" == 'imgurl' ]; then
VALUE=`echo \$i |awk -F= '{print \$2}'`
if [ "http" == "`echo \$VALUE|awk -F: '{print \$1}'`" ]; then
#NOTE needs FT retry code to scale
#NOTE: should prob have max count
FILENAME=`echo \$VALUE|awk -F/ '{print \$NF}'`
while [ ! -r "\$FILENAME" ]; do
echo Getting \$VALUE...
if ! wget \$VALUE; then
sleep 5 #should be random, exponential for scale
rm -f \$FILENAME
fi
done
fi
Thanks
Best Regards
----------------------------------------------------------------------
Wang Xiaopeng (王晓朋)
IBM China System Technology Laboratory
Tel: 86-10-82453455
Email: [email protected]
Address: 28,ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road,
Haidian District Beijing P.R.China 100193
From: Adalberto Medeiros <[email protected]>
To: [email protected]
Date: 2011-08-17 22:23
Subject: [xcat-user] Malformed image and kernel panic for netboot
(diskless) install with SLES11
Hello all!
I'm trying to perform a stateless install with xCAT 2.6.6 between two LPARS
in a Power 7 server under HMC. MN is SLES11SP1 and the image generated for
the CN is SLES11.
I'm getting errors during the genimage step, when installing some of the
packages. And when performing the stateless install using this image, I get
a kernel panic when loading the udev kernel modules. Can someone help me
identify what might be causing this, or else, if I can consider this a bug?
Here is the detailed steps I performed.
1) I generated the image using the default files under netboot for sles11:
junoltc02:/opt/xcat/share/xcat/netboot/sles # ./genimage -i eth0 -n ibmveth
-o sles11 -p compute
There were some errors on generating the image, that came when processing
some of the installed packages (one example for the rpm package
installation):
Recuperando pacote rpm-4.4.2.3-37.8.ppc64 (127/157), 1,7 MiB (5,1 MiB
descompactado)
Instalando: rpm-4.4.2.3-37.8 [CONCLUÍDO]
Saída de rpm adicional:
Updating etc/sysconfig/services...
Starting SuSEconfig, the SuSE Configuration Tool...
Running module permissions only
Reading /etc/sysconfig and updating the system...
Executing /sbin/conf.d/SuSEconfig.permissions...
/lib/udev/devices/ptmx: don't know what to do with that type of file
/lib/udev/devices/tty: don't know what to do with that type of file
ERROR: not all operations were successful.
Checking permissions and ownerships - using the permissions files
/etc/permissions
/etc/permissions.easy
/etc/permissions.local
setting /var/spool/uucp to uucp:uucp 0755. (wrong owner/group root:root)
setting /var/cache/man to man:root 0755. (wrong owner/group root:root)
setting /dev/zero to root:root 0666. (wrong permissions 0644)
setting /dev/null to root:root 0666. (wrong permissions 0644)
setting /etc/ppp to root:dialout 0750. (wrong owner/group root:root)
setting /lib/udev/devices/ptmx to root:tty 0666. (wrong owner/group
root:root)
setting /lib/udev/devices/tty to root:tty 0666. (wrong owner/group
root:root)
setting /sbin/unix_chkpwd to root:shadow 4755. (wrong owner/group
root:root)
setting /usr/src/packages/SOURCES/ to root:root 1777. (wrong permissions
0755)
The complete log for this is attached on this email.
2) Next I successfully ran packimage:
packimage -o sles11 -p compute -a ppc64
3) nodeset junoltc06 netboot
4) junoltc02:/opt/xcat/share/xcat/netboot/sles # lsdef junoltc06
Object name: junoltc06
arch=ppc64
cons=hmc
currstate=netboot sles11-ppc64-compute
groups=lpar,all,cn
hcp=aphmc5
hwtype=lpar
id=8
initrd=xcat/netboot/sles11/ppc64/compute/initrd-stateless.gz
installnic=eth0
kcmdline=imgurl=
http://9.3.189.138/install/netboot/sles11/ppc64/compute/rootimg.gz
XCAT=9.3.189.138:3001 netdev=eth0
kernel=xcat/netboot/sles11/ppc64/compute/kernel
mac=d2:08:3e:2a:3c:03
mgt=hmc
netboot=yaboot
nfsserver=9.3.189.138
nodetype=ppc,osi
os=sles11
parent=junoltc-fsp-8246-L2C-SN100088A
postbootscripts=otherpkgs
postscripts=syslog,remoteshell,syncfiles
pprofile=junoltc06
primarynic=eth0
profile=compute
provmethod=netboot
status=netbooting
statustime=08-16-2011 13:43:13
tftpserver=9.3.189.138
xcatmaster=9.3.189.138
5) rnetboot junoltc06
In the console for the CN, it starts succesfully the boot process. However,
when loading the kernel, it loads some modules, presents some errors about
udevd and get in kernel panic (I m pasting the relevant error part of the
log and attaching the full thing)
...
NET: Registered protocol family 15
registered taskstats version 1
Freeing unused kernel memory: 576k freed
warning: can't open /etc/fstab: No such file or directory
Creating device nodes with udev
udevd[72]: specified group 'audio' unknown
udevd[72]: specified group 'audio' unknown
udevd[72]: specified group 'disk' unknown
udevd[72]: specified group 'disk' unknown
udevd[72]: specified group 'disk' unknown
udevd[72]: specified group 'uucp' unknown
udevd[72]: specified group 'video' unknown
udevd[72]: specified group 'video' unknown
udevd[72]: specified group 'video' unknown
...
udevd[72]: specified group 'disk' unknown
.. :iiii,
:tLL; .,:...,.
.j;:tLt. :. .;j: ij::::;.
:tt;:::,ii:.jEEGi :tDEEG:.ti,::::;t:
.,,,,,,,,,,,tLEEEEj: tDEEEEDtj;,,,::::::
.:,,::::::,;fDEEEEEL,. .,ijDEDDDEEGt,,,,:,ijj;
.... ..:;jDDLGDEEEGGGfjjjjjjfffLGDEEDEEDLjfGDt,:..
.iftffGDLLDEEEDDDEEDDDDEDEEGLfLjjtti:
,fii;jGDGffLjifLGLjtfffffGDEDGfji
;DEEGffDDDjiii;;ii;,tGDEGjfEEEEf.
,GEGGftiGEEEDt:,;,;;LEEDGjLEEEEEEG
;DEDGjtjfitjGGjfDGj;jLLiitfGDEGjEEDj
fGjjtfLfji;itjfGDjLDfjjjji;tGGLDEEDj
fEDGffjti;ittjjjjtjjjjt:,,iiGGGGjtf.
:fGGLfLLfLGf;i;ijffj,,tjLGDDGLfjtf,
:;tLfjiiffLGDDDGLGEEEEjfGDDGGLfjfff:
.. ,;tLLLLLL,;tijfLGGGjfDEEEEDLLGGGLLLjtjLLfi,.
.jffLLLLGGLfjj;: :,;ijLGLfjGEDDEGtfGGLfjj:.,jjLGGLti;,,;fj,
,fGGGGGGLj,. ;jGGGGLLjffftjLj;.. .,tfGGGGGGGGGGi
,jGDDDj,. :tLGLGGLGDLjt, :iLGGDDDDGLif
,LDDDL, .;LDDDDGfff, ,;iGDDj;,..
;fGGGf, ,;;;;,: tf;jL,
;.:::, Powered by xCAT ,j.:;
_________ ________________
___ __\_ ___ \ / _ \__ ___/
\ \/ / \ \/ / /_\ \| |
> <\ \____/ | \ |
/__/\_ \\______ /\____|__ /____|
\/ \/ \/
Failed to download image, panicing in 5...4...3...2...1...0...
You're dead. rpower nodename reset to play again.
* Did you packimage with -m cpio or -m squashfs?
* If using -m squashfs did you include aufs.ko with geninitrd?
e.g.: -n tg3,squashfs,aufs,loop
Kernel panic - not syncing: Attempted to kill init!
Rebooting in 180 seconds..
Best regards,
--
Adalberto Medeiros
Linux Technology Center - Infrastructure Team Lead
IBM Brazil
Email: [email protected][附件 "genimage.log" 被 Xiao Peng Wang/China/IBM
删除][附件 "cn-console.log" 被 Xiao Peng Wang/China/IBM 删除]
------------------------------------------------------------------------------
Get a FREE DOWNLOAD! and learn more about uberSVN rich system,
user administration capabilities and model configuration. Take
the hassle out of deploying and managing Subversion and the
tools developers use with it. http://p.sf.net/sfu/wandisco-d2d-2
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user
<<inline: graycol.gif>>
------------------------------------------------------------------------------ Get a FREE DOWNLOAD! and learn more about uberSVN rich system, user administration capabilities and model configuration. Take the hassle out of deploying and managing Subversion and the tools developers use with it. http://p.sf.net/sfu/wandisco-d2d-2
_______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user
