Hi All,
I am pulling my hair out and I'm pretty sure that bald is not "my look".
Please help?
I have an SGI Altix 310 (XE310 4x2.66GHz Quad Xeon, 32GB mem, 2x250GB drv)
running RHEL 4 WS. On one of the blades (there are two "blades" per
enclosure) I have installed OSCAR 5.0 without difficulty, gone through the
image creation, set up network boot, etc. Doing "dynamic DHCP" and rsync,
but I've tried every option available = no joy.
I turn on the compute node. All is well and it PXE boots and gets to TFTP,
obtains the kernel, runs a bit.
I get a few errors: /proc/mounts: No Such file or directory but it
continues. Later, it says it will sleep to give the switch time to
recognise the ethernet card. It never tries to ping anything and it only
uses the lo interface to try to obtain a dhcp lease - which of course,
fails.
So, it would seem that the systeminstaller kernel doesn't have the drivers
for my ethernet card (there are no real errors - how would I know?)? The
problem is that I also tried the UYOK option (and I did remember to press
"setup network boot") and it makes no difference. I get the same exact
problem.
I found a note in the FAQ that has a workaround for OSCAR 4 and 4.1, but hte
link to the suggested kernel is broken and I don't think it would work in
OSCAR 5 anyway.
One thing I do have that is nice is that my "head node" is the exact same
hardware as the compute node. Can I use its kernel? How would I go about
doing this? I don't know where to start. Can I copy over something from my
/boot directory or something like that? Has anyone seen this problem? This
is the 8th OSCAR cluster I have set up, but the first time on this hardware
and also the first time I've had a real problem getting it to image a node.
I'm really frusterated. :(
Below, I've included the output of a few things - hopefully someone can help
me...
Thanks,
Jennifer
Admin, ORNL Institutional Clusters
Oak Ridge National Lab
Output of lsmod
Module Size Used by
nfsd 274017 17
exportfs 8001 1 nfsd
lockd 78449 2 nfsd
nfs_acl 5313 1 nfsd
parport_pc 29569 0
lp 15345 0
parport 44493 2 parport_pc,lp
autofs4 27080 0
i2c_dev 13889 0
i2c_core 28865 1 i2c_dev
sunrpc 176313 12 nfsd,lockd,nfs_acl
ds 21705 0
yenta_socket 23105 0
pcmcia_core 69969 2 ds,yenta_socket
rdma_ucm 14785 0
ib_srp 33861 0
ib_sdp 39269 0
rdma_cm 28625 2 rdma_ucm,ib_sdp
ib_addr 10185 1 rdma_cm
ib_ipoib 60121 0
loop 18641 0
button 9313 0
battery 11465 0
ac 6985 0
md5 5953 1
ipv6 285089 40
ib_mthca 150737 0
ib_umad 19569 0
ib_ucm 20681 0
ib_uverbs 45297 2 rdma_ucm,ib_ucm
ib_cm 40425 3 ib_srp,rdma_cm,ib_ucm
ib_sa 17749 4 ib_srp,rdma_cm,ib_ipoib,ib_cm
ib_mad 43113 4 ib_mthca,ib_umad,ib_cm,ib_sa
ib_core 59713 11
ib_srp,ib_sdp,rdma_cm,ib_ipoib,ib_mthca,ib_umad,ib_ucm,ib_uverbs,ib_cm,ib_sa,ib_mad
e1000 134225 0
dm_snapshot 19329 0
dm_zero 3905 0
dm_mirror 32201 0
ext3 139089 8
jbd 69745 1 ext3
dm_mod 74153 20 dm_snapshot,dm_zero,dm_mirror
ahci 26181 1
libata 125097 1 ahci
usb_storage 71561 0
uhci_hcd 35305 0
ohci_hcd 24657 0
ehci_hcd 33989 0
sd_mod 19649 3
scsi_mod 145297 5 ib_srp,ahci,libata,usb_storage,sd_mod
output of lspci
00:00.0 Host bridge: Intel Corporation 5000X Chipset Memory Controller Hub
(rev 31)
00:02.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x8
Port 2-3 (rev 31)
00:04.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x8
Port 4-5 (rev 31)
00:06.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x8
Port 6-7 (rev 31)
00:08.0 System peripheral: Intel Corporation 5000 Series Chipset DMA Engine
(rev 31)
00:10.0 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev 31)
00:10.1 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev 31)
00:10.2 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev 31)
00:11.0 Host bridge: Intel Corporation 5000 Series Chipset Reserved
Registers (rev 31)
00:13.0 Host bridge: Intel Corporation 5000 Series Chipset Reserved
Registers (rev 31)
00:15.0 Host bridge: Intel Corporation 5000 Series Chipset FBD Registers
(rev 31)
00:16.0 Host bridge: Intel Corporation 5000 Series Chipset FBD Registers
(rev 31)
00:1d.0 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #1 (rev 09)
00:1d.1 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #2 (rev 09)
00:1d.2 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #3 (rev 09)
00:1d.7 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset EHCI
USB2 Controller (rev 09)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d9)
00:1f.0 ISA bridge: Intel Corporation 631xESB/632xESB/3100 Chipset LPC
Interface Controller (rev 09)
00:1f.2 SATA controller: Intel Corporation 631xESB/632xESB SATA AHCI
Controller (rev 09)
00:1f.3 SMBus: Intel Corporation 631xESB/632xESB/3100 Chipset SMBus
Controller (rev 09)
01:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Upstream
Port (rev 01)
01:00.3 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express to PCI-X
Bridge (rev 01)
02:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Downstream
Port E1 (rev 01)
02:02.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Downstream
Port E3 (rev 01)
04:00.0 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit Ethernet
Controller (Copper) (rev 01)
04:00.1 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit Ethernet
Controller (Copper) (rev 01)
06:00.0 InfiniBand: Mellanox Technologies MT25204 [InfiniHost III Lx HCA]
(rev 20)
08:01.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02)
output of cat /proc/pci
PCI devices found:
Bus 0, device 0, function 0:
Class 0600: PCI device 8086:25c0 (rev 49).
IRQ 169.
Bus 0, device 2, function 0:
Class 0604: PCI device 8086:25f7 (rev 49).
IRQ 169.
Master Capable. No bursts. Min Gnt=4.
Bus 0, device 4, function 0:
Class 0604: PCI device 8086:25f8 (rev 49).
IRQ 169.
Master Capable. No bursts. Min Gnt=4.
Bus 0, device 6, function 0:
Class 0604: PCI device 8086:25f9 (rev 49).
IRQ 169.
Master Capable. No bursts. Min Gnt=4.
Bus 0, device 8, function 0:
Class 0880: PCI device 8086:1a38 (rev 49).
IRQ 169.
Non-prefetchable 64 bit memory at 0xfe700000 [0xfe7003ff].
Bus 0, device 16, function 0:
Class 0600: PCI device 8086:25f0 (rev 49).
Bus 0, device 16, function 1:
Class 0600: PCI device 8086:25f0 (rev 49).
Bus 0, device 16, function 2:
Class 0600: PCI device 8086:25f0 (rev 49).
Bus 0, device 17, function 0:
Class 0600: PCI device 8086:25f1 (rev 49).
Bus 0, device 19, function 0:
Class 0600: PCI device 8086:25f3 (rev 49).
Bus 0, device 21, function 0:
Class 0600: PCI device 8086:25f5 (rev 49).
Bus 0, device 22, function 0:
Class 0600: PCI device 8086:25f6 (rev 49).
Bus 0, device 29, function 0:
Class 0c03: PCI device 8086:2688 (rev 9).
IRQ 177.
I/O at 0x1800 [0x181f].
Bus 0, device 29, function 1:
Class 0c03: PCI device 8086:2689 (rev 9).
IRQ 185.
I/O at 0x1820 [0x183f].
Bus 0, device 29, function 2:
Class 0c03: PCI device 8086:268a (rev 9).
IRQ 193.
I/O at 0x1840 [0x185f].
Bus 0, device 29, function 7:
Class 0c03: PCI device 8086:268c (rev 9).
IRQ 177.
Non-prefetchable 32 bit memory at 0xd8e00000 [0xd8e003ff].
Bus 0, device 30, function 0:
Class 0604: PCI device 8086:244e (rev 217).
Master Capable. No bursts. Min Gnt=12.
Bus 0, device 31, function 0:
Class 0601: PCI device 8086:2670 (rev 9).
Bus 0, device 31, function 2:
Class 0106: PCI device 8086:2681 (rev 9).
IRQ 185.
I/O at 0x1890 [0x1897].
I/O at 0x1884 [0x1887].
I/O at 0x1888 [0x188f].
I/O at 0x1880 [0x1883].
I/O at 0x1860 [0x187f].
Non-prefetchable 32 bit memory at 0xd8e00400 [0xd8e007ff].
Bus 0, device 31, function 3:
Class 0c05: PCI device 8086:269b (rev 9).
IRQ 185.
I/O at 0x1100 [0x111f].
Bus 1, device 0, function 0:
Class 0604: PCI device 8086:3500 (rev 1).
IRQ 169.
Master Capable. No bursts. Min Gnt=4.
Bus 1, device 0, function 3:
Class 0604: PCI device 8086:350c (rev 1).
Master Capable. No bursts. Min Gnt=4.
Bus 2, device 0, function 0:
Class 0604: PCI device 8086:3510 (rev 1).
IRQ 169.
Master Capable. No bursts. Min Gnt=4.
Bus 2, device 2, function 0:
Class 0604: PCI device 8086:3518 (rev 1).
IRQ 193.
Master Capable. No bursts. Min Gnt=4.
Bus 4, device 0, function 0:
Class 0200: PCI device 8086:1096 (rev 1).
IRQ 209.
Non-prefetchable 32 bit memory at 0xd8a00000 [0xd8a1ffff].
I/O at 0x2000 [0x201f].
Bus 4, device 0, function 1:
Class 0200: PCI device 8086:1096 (rev 1).
IRQ 185.
Non-prefetchable 32 bit memory at 0xd8a20000 [0xd8a3ffff].
I/O at 0x2020 [0x203f].
Bus 6, device 0, function 0:
Class 0c06: PCI device 15b3:6274 (rev 32).
IRQ 169.
Non-prefetchable 64 bit memory at 0xd8800000 [0xd88fffff].
Prefetchable 64 bit memory at 0xd8000000 [0xd87fffff].
Bus 8, device 1, function 0:
Class 0300: PCI device 1002:515e (rev 2).
IRQ 193.
Master Capable. Latency=66. Min Gnt=8.
Prefetchable 32 bit memory at 0xd0000000 [0xd7ffffff].
I/O at 0x3000 [0x30ff].
Non-prefetchable 32 bit memory at 0xd8b00000 [0xd8b0ffff].
Is UYOK in production?
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users