Hi All, First, I am new to this list, so let me introduce myself. I am primarily a statistician who runs a small high performance cluster in an academic department. Because I am first and foremost an academic, I have tried to automate as much of the cluster maintenance as I can, so that I can spend as little time tinkering with it as possible. Hence I decided to use FAI!
The original cluster was composed of Dell PE1950 servers (with a PE2950 fileserver). This all went together very well, and FAI performed really well for installing the 5 execution nodes. However, I have just recently purchased two more execution nodes, this time Dell R410s as the 1950s are now obsolete. As you will see from the log below, the PXE boot works fine, up until the root filesystem is mounted. I then get a kernel panic. Can anyone suggest why? Cheers, Chris Log snippet: (full log at http://www2.warwick.ac.uk/fac/sci/statistics/staff/research/jewell/fai-fail-log.txt) Begin: Loading essential drivers... ... [ 5.369859] md: linear personality registered for level -1 [ 5.381653] Fusion MPT base driver 3.04.10 [ 5.384557] md: multipath personality registered for level -4 [ 5.386658] md: raid0 personality registered for level 0 [ 5.389990] md: raid1 personality registered for level 1 [ 5.392157] xor: automatically using best checksumming function: generic_sse [ 5.392448] Adding 4114496k swap on /dev/ramzswap0. Priority:100 extents:1 across:4114496k SSD [ 5.417716] Copyright (c) 1999-2008 LSI Corporation [ 5.433996] Fusion MPT SAS Host driver 3.04.10 [ 5.439527] generic_sse: 5934.800 MB/sec [ 5.444072] xor: using function: generic_sse (5934.800 MB/sec) [ 5.452725] mptsas 0000:02:00.0: PCI INT A -> GSI 32 (level, low) -> IRQ 32 [ 5.460888] async_tx: api initialized (async) [ 5.519340] usb 5-1: new low speed USB device using uhci_hcd and address 2 [ 5.629048] raid6: int64x1 1416 MB/s [ 5.712992] usb 5-1: configuration #1 chosen from 1 choice [ 5.744145] usbcore: registered new interface driver hiddev [ 5.762954] input: Avocent Dell 03R874 as /devices/pci0000:00/0000:00:1d.0/usb5/5-1/5-1:1.0/input/input2 [ 5.772446] generic-usb 0003:0624:0294.0001: input,hidraw0: USB HID v1.10 Keyboard [Avocent Dell 03R874] on usb-0000:00:1d.0-1/input0 [ 5.798580] raid6: int64x2 1825 MB/s [ 5.803894] input: Avocent Dell 03R874 as /devices/pci0000:00/0000:00:1d.0/usb5/5-1/5-1:1.1/input/input3 [ 5.813402] generic-usb 0003:0624:0294.0002: input,hidraw1: USB HID v1.10 Mouse [Avocent Dell 03R874] on usb-0000:00:1d.0-1/input1 [ 5.825095] usbcore: registered new interface driver usbhid [ 5.830642] usbhid: v2.6:USB HID core driver [ 5.968148] raid6: int64x4 1229 MB/s [ 6.137669] raid6: int64x8 1845 MB/s [ 6.307209] raid6: sse2x1 7953 MB/s [ 6.476759] raid6: sse2x2 9290 MB/s [ 6.646301] raid6: sse2x4 10607 MB/s [ 6.650291] raid6: using algorithm sse2x4 (10607 MB/s) [ 6.655448] mptbase: ioc0: Initiating bringup [ 6.663059] md: raid6 personality registered for level 6 [ 6.668376] md: raid5 personality registered for level 5 [ 6.673661] md: raid4 personality registered for level 4 [ 6.683360] md: raid10 personality registered for level 10 Done. Begin: Running /scripts/init-premount ... Done. Begin: Mounting root file system... ... Begin: Running /scripts/live-premount ... Done. [ 8.221965] ioc0: LSISAS1068E B3: Capabilities={Initiator} [ 25.878441] scsi4 : ioc0: LSISAS1068E B3, FwRev=00192f00h, Ports=1, MaxQ=266, IRQ=32 [ 25.922536] mptsas: ioc0: attaching sata device: fw_channel 0, fw_id 0, phy 0, sas_addr 0x1221000000000000 [ 25.934580] scsi 4:0:0:0: Direct-Access ATA WDC WD1602ABKS-1 3B04 PQ: 0 ANSI: 5 [ 25.944316] sd 4:0:0:0: Attached scsi generic sg1 type 0 [ 25.950718] sd 4:0:0:0: [sda] 312500000 512-byte logical blocks: (160 GB/149 GiB) [ 25.966492] sd 4:0:0:0: [sda] Write Protect is off [ 25.973310] sd 4:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA [ 25.993370] sda: sda1 sda2 [ 26.017709] sd 4:0:0:0: [sda] Attached SCSI disk ata_id[534]: HDIO_GET_IDENTITY failed for '/dev/.tmp-block-8:0' [ 26.125158] Kernel panic - not syncing: Attempted to kill init! [ 26.131052] Pid: 1, comm: init Not tainted 2.6.31-20-generic #58-Ubuntu [ 26.137640] Call Trace: [ 26.140082] [<ffffffff8152a0dd>] panic+0x73/0x12b [ 26.144857] [<ffffffff81120fe4>] ? __fput+0x194/0x210 [ 26.149971] [<ffffffff8106039b>] find_new_reaper+0x9b/0xa0 [ 26.155520] [<ffffffff81060f7d>] forget_original_parent+0x3d/0x290 [ 26.161756] [<ffffffff8106058c>] ? put_files_struct+0xbc/0xe0 [ 26.167562] [<ffffffff810611e6>] exit_notify+0x16/0x1c0 [ 26.172848] [<ffffffff810619c5>] do_exit+0x1c5/0x360 [ 26.177876] [<ffffffff81061ba9>] do_group_exit+0x49/0xc0 [ 26.183249] [<ffffffff81061c32>] sys_exit_group+0x12/0x20 [ 26.188712] [<ffffffff81012082>] system_call_fastpath+0x16/0x1b
