Josh,

I added the debugging code. It turned out NFS looks fine. I also manually 
verified the NFS share:

  [root@blade14 ~]# mkdir nfstest
  [root@blade14 ~]# mount 172.20.0.1:/opt/image/x86 nfstest/
  [root@blade14 ~]# cd nfstest/
  [root@blade14 nfstest]# mkdir writetest



How to debug the partimage save operation?

Thanks,
John Ma
Marist College




From:   Josh Thompson <josh_thomp...@ncsu.edu>
To:     vcl-user@incubator.apache.org
Date:   04/01/2011 09:06 AM
Subject:        Re: VCL2.2 + xCAT2.5 on bladecenter



-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

John,

The partimageng postscript mounts an image store via NFS at /install.  The 
NFS 
server and path are specified in the xCAT site table as IMAGELIBSERVER and 

IMAGELIBINSTALLDIR.  More info about this part is at the bottom of our 
wiki 
page explaining how to add partimage support to xCAT.

Do you have your image store exported read/write via NFS and available to 
the 
client nodes?

As a test, you could modify the partimageng script to output more 
debugging 
info.  You could modify the mount command on line 144 to be:

logger -t xcat "Attempting to mount image store: 
$IMAGELIBSERVER:$IMAGELIBINSTALLDIR"
if ! mount -o nfsvers=3,tcp,nolock,rw $IMAGELIBSERVER:$IMAGELIBINSTALLDIR 
/install; then
    echo "CRITICAL ERROR: Failed to mount image store at 
$IMAGELIBSERVER:$IMAGELIBINSTALLDIR; unable to save image"
    logger -t xcat "CRITICAL ERROR: Failed to mount image store at 
$IMAGELIBSERVER:$IMAGELIBINSTALLDIR; unable to save image"
    sleep 3
    exit 1
fi

Josh

On Thursday March 31, 2011, John Ma wrote:
> Josh,
> 
> Thanks again for the help. I configured an anonymous ftp share of
> /install, and it passed the previous error. Now I am at here:
>   Mar 30 21:06:53 blade08 blade08 xcat: running partimage -z1 -f3 -odbc
> save /dev/sda1 /install/image/x86/centos5image-blade08mar2466-v0.gz
>   Mar 30 21:06:58 blade08 blade08 xcat: partimage exited with a non-zero
> status, failing
>   Mar 30 21:06:58 blade08 blade08 xcat: partimage-ng failed with exit 
code
> 1
>   Mar 30 21:06:58 blade08 blade08 init: rc3 main process (1166) killed 
by
> TERM signal
> 
> Blade08 then rebooted itself and loop again. partimage's save location
> /install/image/x86/.. doesn't seem right to me, but how to configure it 
to
> use nfs?
> See the attached log file for more details, ( the clock on blade08 is 
off
> or maybe UTC)
> 
> 
> Thanks,
> John
> 
> 
> 
> 
> 
> 
> From:   Josh Thompson <josh_thomp...@ncsu.edu>
> To:     vcl-user@incubator.apache.org
> Date:   03/31/2011 03:29 PM
> Subject:        Re: VCL2.2 + xCAT2.5 on bladecenter
> 
> 
> 
> - gpg control packet
> John,
> 
> Sorry to take so long to get back to you.
> 
> I didn't even realize this until digging through xcatdsklspost, but your
> management node needs to be sharing out /install via ftp.  I'm assuming
> xcat
> sets this up because I don't remember setting that up manually.  The
> following
> line is from xcatdsklspost:
> 
> wget -l inf -N -r --waitretry=10 --random-wait --retry-connrefused -t 0 
-T
> 60
> ftp://$SIP/postscripts 2> /tmp/wget.log
> 
> $SIP is obtained earlier in the script from some dhcp information.
> 
> The next line is where your screenshot shows the first error:
> 
> mv $SIP/postscripts/* /xcatpost;
> 
> The wget command should try forever until it downloads everything under
> ftp://$SIP/postscripts.  The fact that you are getting past wget, but 
the
> move
> is failing for $SIP/postscripts/* makes me think you don't have anything
> under
> ftp://$SIP/postscripts.  Can you try using a normal ftp client to browse
> ftp://172.20.101.140/postscripts?  It may be that the ftp server is
> sharing
> out the wrong directory.
> 
> Josh
> 
> On Friday March 25, 2011, John Ma wrote:
> > Josh,
> > 
> > I made some progress, but stuck again. This time at the reboot of the
> > machine being captured. The machine apparently cannot find 
postscripts.
> > Any idea about how to fix it or what to try next?
> > 
> > I placed partimageng in /install/postscripts on our VCL (web, db, and
> 
> mgt
> 
> > code) server - Blade14 (172.20.101.140). The machine being captured is
> > blade08 (172.20.101.80).
> > 
> > Here is the screenshot:
> > 
> > Here is the pxe boot config file:
> > [root@blade14 ~]# cat /tftpboot/pxelinux.cfg/blade08
> > #image image-x86-centos5image-blade08mar2466-v0
> > DEFAULT xCAT
> > LABEL xCAT
> > 
> >  KERNEL xcat/image/x86/vmlinuz
> >  APPEND initrd=xcat/image/x86/initrd.img
> > 
> > imgurl=http://blade14//install/image/x86/installer_files/rootimg.gz
> > image=/install/image/x86/centos5image-blade08mar2466-v0.img blocks=512
> > action=save installnic=eth0 reboot  noipv6
> > 
> >   IPAPPEND 2
> > 
> > [root@blade14 ~]#
> > 
> > Thanks,
> > John Ma
> > Marist College
> > 
> > 
> > 
> > 
> > From:   Josh Thompson <josh_thomp...@ncsu.edu>
> > To:     vcl-user@incubator.apache.org
> > Date:   03/22/2011 12:15 PM
> > Subject:        Re: VCL2.2 + xCAT2.5 on bladecenter
- -- 
- -------------------------------
Josh Thompson
VCL Developer
North Carolina State University

my GPG/PGP key can be found at pgp.mit.edu
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.16 (GNU/Linux)

iEYEARECAAYFAk2VzaAACgkQV/LQcNdtPQPPfwCfZF5WlXYUnvLAV6XXiPG4ENQe
k7MAnAgiDIaXKzr8Lr9dClRuVGp6peaK
=MuU1
-----END PGP SIGNATURE-----

Apr  1 09:19:09 blade14 dhcpd: DHCPDISCOVER from 00:1a:64:33:b1:2c via eth0
Apr  1 09:19:09 blade14 dhcpd: DHCPOFFER on 172.20.101.80 to 00:1a:64:33:b1:2c 
via eth0
Apr  1 09:19:13 blade14 dhcpd: DHCPREQUEST for 172.20.101.80 (172.20.101.140) 
from 00:1a:64:33:b1:2c via eth0
Apr  1 09:19:13 blade14 dhcpd: DHCPACK on 172.20.101.80 to 00:1a:64:33:b1:2c 
via eth0
Apr  1 09:19:13 blade14 atftpd[3280]: Serving pxelinux.0 to 172.20.101.80:2070
Apr  1 09:19:13 blade14 atftpd[3280]: Serving pxelinux.0 to 172.20.101.80:2071
Apr  1 09:19:14 blade14 atftpd[3280]: Serving pxelinux.cfg/01-00-1a-64-33-b1-2c 
to 172.20.101.80:57089
Apr  1 09:19:14 blade14 atftpd[3280]: Serving pxelinux.cfg/AC146550 to 
172.20.101.80:57090
Apr  1 09:19:14 blade14 atftpd[3280]: Serving xcat/image/x86/vmlinuz to 
172.20.101.80:57091
Apr  1 09:19:14 blade14 atftpd[3280]: Serving xcat/image/x86/initrd.img to 
172.20.101.80:57092
Apr  1 09:19:26 blade14 dhcpd: DHCPDISCOVER from 00:1a:64:33:b1:2c via eth0
Apr  1 09:19:26 blade14 dhcpd: DHCPOFFER on 172.20.101.80 to 00:1a:64:33:b1:2c 
via eth0
Apr  1 09:19:26 blade14 named[2612]: client 172.20.101.140#39211: request has 
invalid signature: TSIG xcat_key: tsig verify failure (BADKEY)
Apr  1 09:19:26 blade14 dhcpd: Unable to add forward map from 
blade08.vcl22.marist.edu to 172.20.101.80: bad DNS key
Apr  1 09:19:26 blade14 dhcpd: DHCPREQUEST for 172.20.101.80 (172.20.101.140) 
from 00:1a:64:33:b1:2c via eth0
Apr  1 09:19:26 blade14 dhcpd: DHCPACK on 172.20.101.80 to 00:1a:64:33:b1:2c 
via eth0
Apr  1 09:19:42 blade14 named[2612]: client 172.20.101.140#55484: request has 
invalid signature: TSIG xcat_key: tsig verify failure (BADKEY)
Apr  1 09:19:42 blade14 dhcpd: Unable to add forward map from 
blade08.vcl22.marist.edu to 172.20.101.80: bad DNS key
Apr  1 09:19:42 blade14 dhcpd: DHCPREQUEST for 172.20.101.80 from 
00:1a:64:33:b1:2c via eth0
Apr  1 09:19:42 blade14 dhcpd: DHCPACK on 172.20.101.80 to 00:1a:64:33:b1:2c 
via eth0
Apr  1 09:19:51 blade14 xCAT: xCAT: Allowing getpostscript from blade08
Mar 29 21:19:45 blade08 blade08 kernel: imklog 3.14.1, log source = /proc/kmsg 
started.
Mar 29 21:19:45 blade08 blade08 kernel: Cannot find map file.
Mar 29 21:19:45 blade08 blade08 kernel: No module symbols loaded - kernel 
modules not enabled.
Mar 29 21:19:45 blade08 blade08 kernel: cannot find any symbols, turning off 
symbol lookups
Mar 29 21:19:45 blade08 blade08 rsyslogd: [origin software="rsyslogd" 
swVersion="3.14.1" x-pid="1503" x-info="http://www.rsyslog.com";] restart
Mar 29 21:19:45 blade08 blade08 kernel: Linux version 2.6.18-194.3.1.el5 
(mockbu...@x86-004.build.bos.redhat.com) (gcc version 4.1.2 20080704 (Red Hat 
4.1.2-48)) #1 SMP Sun May 2 04:17:42 EDT 2010
Mar 29 21:19:45 blade08 blade08 kernel: Command line: 
initrd=xcat/image/x86/initrd.img 
imgurl=http://blade14//install/image/x86/installer_files/rootimg.gz 
image=/install/image/x86/centos5image-blade08mar2466-v0.img blocks=512 
action=save installnic=eth0 reboot noipv6 BOOT_IMAGE=xcat/image/x86/vmlinuz 
BOOTIF=01-00-1a-64-33-b1-2c
Mar 29 21:19:45 blade08 blade08 kernel: BIOS-provided physical RAM map:
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 0000000000010000 - 
000000000009cc00 (usable)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 000000000009cc00 - 
00000000000a0000 (reserved)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 00000000000e0000 - 
0000000000100000 (reserved)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 0000000000100000 - 
00000000cffbcc80 (usable)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 00000000cffbcc80 - 
00000000cffd0000 (ACPI data)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 00000000cffd0000 - 
00000000d0000000 (reserved)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 00000000e0000000 - 
00000000f0000000 (reserved)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 00000000fec00000 - 
0000000100000000 (reserved)
Mar 29 21:19:45 blade08 blade08 kernel:  BIOS-e820: 0000000100000000 - 
0000000130000000 (usable)
Mar 29 21:19:45 blade08 blade08 kernel: DMI 2.4 present.
Mar 29 21:19:45 blade08 blade08 kernel: No NUMA configuration found
Mar 29 21:19:45 blade08 blade08 kernel: Faking a node at 
0000000000000000-0000000130000000
Mar 29 21:19:45 blade08 blade08 kernel: Bootmem setup node 0 
0000000000000000-0000000130000000
Mar 29 21:19:45 blade08 blade08 kernel: Memory for crash kernel (0x0 to 0x0) 
notwithin permissible range
Mar 29 21:19:45 blade08 blade08 kernel: disabling kdump
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: PM-Timer IO Port: 0x588
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC (acpi_id[0x00] 
lapic_id[0x00] enabled)
Mar 29 21:19:45 blade08 blade08 kernel: Processor #0 6:15 APIC version 20
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC (acpi_id[0x01] 
lapic_id[0x06] enabled)
Mar 29 21:19:45 blade08 blade08 kernel: Processor #6 6:15 APIC version 20
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC (acpi_id[0x02] 
lapic_id[0x01] enabled)
Mar 29 21:19:45 blade08 blade08 kernel: Processor #1 6:15 APIC version 20
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC (acpi_id[0x03] 
lapic_id[0x07] enabled)
Mar 29 21:19:45 blade08 blade08 kernel: Processor #7 6:15 APIC version 20
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC_NMI (acpi_id[0x00] dfl dfl 
lint[0x1])
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC_NMI (acpi_id[0x01] dfl dfl 
lint[0x1])
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC_NMI (acpi_id[0x02] dfl dfl 
lint[0x1])
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: LAPIC_NMI (acpi_id[0x03] dfl dfl 
lint[0x1])
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: IOAPIC (id[0x0e] 
address[0xfec00000] gsi_base[0])
Mar 29 21:19:45 blade08 blade08 kernel: IOAPIC[0]: apic_id 14, version 32, 
address 0xfec00000, GSI 0-23
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: IOAPIC (id[0x0d] 
address[0xfec80000] gsi_base[24])
Mar 29 21:19:45 blade08 blade08 kernel: IOAPIC[1]: apic_id 13, version 32, 
address 0xfec80000, GSI 24-47
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 0 
global_irq 2 dfl dfl)
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 9 
global_irq 9 high level)
Mar 29 21:19:45 blade08 blade08 kernel: Setting APIC routing to physical flat
Apr  1 09:19:52 blade14 xCAT: xCAT: Allowing getcredentials ssh_dsa_hostkey 
from blade08
Mar 29 21:19:45 blade08 blade08 kernel: Using ACPI (MADT) for SMP configuration 
information
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 000000000009c000 
- 000000000009d000
Apr  1 09:19:53 blade14 xCAT: xCAT: Allowing getcredentials ssh_rsa_hostkey 
from blade08
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 000000000009d000 
- 00000000000a0000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000000a0000 
- 00000000000e0000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000000e0000 
- 0000000000100000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000cffbc000 
- 00000000cffbd000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000cffbd000 
- 00000000cffd0000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000cffd0000 
- 00000000d0000000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000d0000000 
- 00000000e0000000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000e0000000 
- 00000000f0000000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000f0000000 
- 00000000fec00000
Mar 29 21:19:45 blade08 blade08 kernel: Nosave address range: 00000000fec00000 
- 0000000100000000
Mar 29 21:19:45 blade08 blade08 kernel: Allocating PCI resources starting at 
d1000000 (gap: d0000000:10000000)
Mar 29 21:19:45 blade08 blade08 kernel: SMP: Allowing 4 CPUs, 0 hotplug CPUs
Mar 29 21:19:45 blade08 blade08 kernel: Built 1 zonelists.  Total pages: 1030066
Mar 29 21:19:45 blade08 blade08 kernel: Kernel command line: 
initrd=xcat/image/x86/initrd.img 
imgurl=http://blade14//install/image/x86/installer_files/rootimg.gz 
image=/install/image/x86/centos5image-blade08mar2466-v0.img blocks=512 
action=save installnic=eth0 reboot noipv6 BOOT_IMAGE=xcat/image/x86/vmlinuz 
BOOTIF=01-00-1a-64-33-b1-2c
Mar 29 21:19:45 blade08 blade08 kernel: Initializing CPU#0
Mar 29 21:19:45 blade08 blade08 kernel: PID hash table entries: 4096 (order: 
12, 32768 bytes)
Mar 29 21:19:45 blade08 blade08 kernel: Console: colour VGA+ 80x25
Mar 29 21:19:45 blade08 blade08 kernel: Dentry cache hash table entries: 524288 
(order: 10, 4194304 bytes)
Mar 29 21:19:45 blade08 blade08 kernel: Inode-cache hash table entries: 262144 
(order: 9, 2097152 bytes)
Mar 29 21:19:45 blade08 blade08 kernel: Checking aperture...
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: DMAR not present
Mar 29 21:19:45 blade08 blade08 kernel: PCI-DMA: Using software bounce 
buffering for IO (SWIOTLB)
Mar 29 21:19:45 blade08 blade08 kernel: Placing software IO TLB between 
0x163c000 - 0x563c000
Mar 29 21:19:45 blade08 blade08 kernel: Memory: 4044092k/4980736k available 
(2573k kernel code, 149476k reserved, 1305k data, 212k init)
Mar 29 21:19:45 blade08 blade08 kernel: Calibrating delay loop (skipped), value 
calculated using timer frequency.. 5333.72 BogoMIPS (lpj=2666864)
Mar 29 21:19:45 blade08 blade08 kernel: Security Framework v1.0.0 initialized
Mar 29 21:19:45 blade08 blade08 kernel: SELinux:  Initializing.
Mar 29 21:19:45 blade08 blade08 kernel: selinux_register_security:  Registering 
secondary module capability
Mar 29 21:19:45 blade08 blade08 kernel: Capability LSM initialized as secondary
Mar 29 21:19:45 blade08 blade08 kernel: Mount-cache hash table entries: 256
Mar 29 21:19:45 blade08 blade08 kernel: CPU: L1 I cache: 32K, L1 D cache: 32K
Mar 29 21:19:45 blade08 blade08 kernel: CPU: L2 cache: 4096K
Mar 29 21:19:45 blade08 blade08 kernel: using mwait in idle threads.
Mar 29 21:19:45 blade08 blade08 kernel: CPU: Physical Processor ID: 0
Mar 29 21:19:45 blade08 blade08 kernel: CPU: Processor Core ID: 0
Mar 29 21:19:45 blade08 blade08 kernel: CPU0: Thermal monitoring enabled (TM1)
Mar 29 21:19:45 blade08 blade08 kernel: SMP alternatives: switching to UP code
Mar 29 21:19:45 blade08 blade08 kernel: ACPI: Core revision 20060707
Mar 29 21:19:45 blade08 blade08 kernel: Using local APIC timer interrupts.
Mar 29 21:19:45 blade08 blade08 kernel: Detected 20.834 MHz APIC timer.
Mar 29 21:19:45 blade08 blade08 kernel: SMP alternatives: switching to SMP code
Mar 29 21:19:45 blade08 blade08 kernel: Booting processor 1/4 APIC 0x6
Mar 29 21:19:45 blade08 blade08 kernel: Initializing CPU#1
Apr  1 09:19:54 blade14 xCAT: xCAT: Allowing getcredentials ssh_root_key from 
blade08
Mar 29 21:19:45 blade08 blade08 kernel: Calibrating delay using timer specific 
routine.. 5332.78 BogoMIPS (lpj=2666391)
Mar 29 21:19:45 blade08 blade08 kernel: CPU: L1 I cache: 32K, L1 D cache: 32K
Mar 29 21:19:45 blade08 blade08 kernel: CPU: L2 cache: 4096K
Mar 29 21:19:45 blade08 blade08 kernel: CPU: Physical Processor ID: 3
Mar 29 21:19:45 blade08 blade08 kernel: CPU: Processor Core ID: 0
Mar 29 21:19:45 blade08 blade08 kernel: CPU1: Thermal monitoring enabled (TM1)
Mar 29 21:19:45 blade08 blade08 kernel: Intel(R) Xeon(R) CPU            5150  @ 
2.66GHz stepping 06
Mar 29 21:19:45 blade08 blade08 kernel: SMP alternatives: switching to SMP code
Mar 29 21:19:45 blade08 blade08 kernel: Booting processor 2/4 APIC 0x1
Mar 29 21:19:45 blade08 blade08 kernel: Initializing CPU#2
Mar 29 21:19:45 blade08 blade08 kernel: Calibrating delay using timer specific 
routine.. 5332.86 BogoMIPS (lpj=2666430)
Mar 29 21:19:45 blade08 blade08 kernel: CPU: L1 I cache: 32K, L1 D cache: 32K
Mar 29 21:19:45 blade08 blade08 kernel: CPU: L2 cache: 4096K
Mar 29 21:19:45 blade08 blade08 kernel: CPU: Physical Processor ID: 0
Mar 29 21:19:46 blade08 blade08 xCAT: ssh_dsa_hostkey
Mar 29 21:19:46 blade08 blade08 xCAT: ssh_rsa_hostkey
Mar 29 21:19:47 blade08 blade08 xCAT: ssh_root_key
Mar 29 21:19:47 blade08 blade08 xCAT: start up sshd
Mar 29 21:19:47 blade08 blade08 xCAT: /xcatpost/syncfiles: there is no sync 
file template for the node
Mar 29 21:19:47 blade08 blade08 kernel: FS-Cache: Loaded
Mar 29 21:19:47 blade08 blade08 kernel: Fusion MPT SPI Host driver 3.04.13rh
Mar 29 21:19:47 blade08 blade08 xcat: Attempting to mount image store: 
172.20.0.1:/opt/image/x86
Mar 29 21:19:57 blade08 blade08 xcat: disks: 
#012../../sda#012../../sda1#012../../sda2
Mar 29 21:19:57 blade08 blade08 xcat: No device specified, trying with guessed 
device /dev/sda
Mar 29 21:19:58 blade08 blade08 xcat: Getting partitons for sda
Mar 29 21:19:58 blade08 blade08 xcat: number of partitions found: 1
Mar 29 21:19:58 blade08 blade08 xcat: saving individual partitions
Mar 29 21:19:58 blade08 blade08 xcat: working with sda1
Mar 29 21:19:58 blade08 blade08 xcat: running partimage -z1 -f3 -odbc save 
/dev/sda1 /install/image/x86/centos5image-blade08mar2466-v0.gz
Mar 29 21:20:03 blade08 blade08 xcat: partimage exited with a non-zero status, 
failing
Mar 29 21:20:03 blade08 blade08 xcat: partimage-ng failed with exit code 1
Mar 29 21:20:03 blade08 blade08 init: rc3 main process (1173) killed by TERM 
signal

Reply via email to