Hi All,

I am pulling my hair out and I'm pretty sure that bald is not "my look".
Please help?

I have an SGI Altix 310 (XE310 4x2.66GHz Quad Xeon, 32GB mem, 2x250GB drv)
running RHEL 4 WS.  On one of the blades (there are two "blades" per
enclosure) I have installed OSCAR 5.0 without difficulty, gone through the
image creation, set up network boot, etc.  Doing "dynamic DHCP" and rsync,
but I've tried every option available = no joy.

I turn on the compute node.  All is well and it PXE boots and gets to TFTP,
obtains the kernel, runs a bit.
I get a few errors:  /proc/mounts: No Such file or directory  but it
continues.  Later, it says it will sleep to give the switch time to
recognise the ethernet card.   It never tries to ping anything and it only
uses the lo interface to try to obtain a dhcp lease - which of course,
fails.

So, it would seem that the systeminstaller kernel doesn't have the drivers
for my ethernet card (there are no real errors - how would I know?)?  The
problem is that I also tried the UYOK option (and I did remember to press
"setup network boot") and it makes no difference.  I get the same exact
problem.

I found a note in the FAQ that has a workaround for OSCAR 4 and 4.1, but hte
link to the suggested kernel is broken and I don't think it would work in
OSCAR 5 anyway.

One thing I do have that is nice is that my "head node" is the exact same
hardware as the compute node.  Can I use its kernel?  How would I go about
doing this?  I don't know where to start.  Can I copy over something from my
/boot directory or something like that?  Has anyone seen this problem?  This
is the 8th OSCAR cluster I have set up, but the first time on this hardware
and also the first time I've had a real problem getting it to image a node.
I'm really frusterated. :(

Below, I've included the output of a few things - hopefully someone can help
me...

Thanks,
Jennifer
Admin, ORNL Institutional Clusters
Oak Ridge National Lab

Output of lsmod
Module                  Size  Used by
nfsd                  274017  17
exportfs                8001  1 nfsd
lockd                  78449  2 nfsd
nfs_acl                 5313  1 nfsd
parport_pc             29569  0
lp                     15345  0
parport                44493  2 parport_pc,lp
autofs4                27080  0
i2c_dev                13889  0
i2c_core               28865  1 i2c_dev
sunrpc                176313  12 nfsd,lockd,nfs_acl
ds                     21705  0
yenta_socket           23105  0
pcmcia_core            69969  2 ds,yenta_socket
rdma_ucm               14785  0
ib_srp                 33861  0
ib_sdp                 39269  0
rdma_cm                28625  2 rdma_ucm,ib_sdp
ib_addr                10185  1 rdma_cm
ib_ipoib               60121  0
loop                   18641  0
button                  9313  0
battery                11465  0
ac                      6985  0
md5                     5953  1
ipv6                  285089  40
ib_mthca              150737  0
ib_umad                19569  0
ib_ucm                 20681  0
ib_uverbs              45297  2 rdma_ucm,ib_ucm
ib_cm                  40425  3 ib_srp,rdma_cm,ib_ucm
ib_sa                  17749  4 ib_srp,rdma_cm,ib_ipoib,ib_cm
ib_mad                 43113  4 ib_mthca,ib_umad,ib_cm,ib_sa
ib_core                59713  11
ib_srp,ib_sdp,rdma_cm,ib_ipoib,ib_mthca,ib_umad,ib_ucm,ib_uverbs,ib_cm,ib_sa,ib_mad
e1000                 134225  0
dm_snapshot            19329  0
dm_zero                 3905  0
dm_mirror              32201  0
ext3                  139089  8
jbd                    69745  1 ext3
dm_mod                 74153  20 dm_snapshot,dm_zero,dm_mirror
ahci                   26181  1
libata                125097  1 ahci
usb_storage            71561  0
uhci_hcd               35305  0
ohci_hcd               24657  0
ehci_hcd               33989  0
sd_mod                 19649  3
scsi_mod              145297  5 ib_srp,ahci,libata,usb_storage,sd_mod


output of lspci
00:00.0 Host bridge: Intel Corporation 5000X Chipset Memory Controller Hub
(rev 31)
00:02.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x8
Port 2-3 (rev 31)
00:04.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x8
Port 4-5 (rev 31)
00:06.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x8
Port 6-7 (rev 31)
00:08.0 System peripheral: Intel Corporation 5000 Series Chipset DMA Engine
(rev 31)
00:10.0 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev 31)
00:10.1 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev 31)
00:10.2 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev 31)
00:11.0 Host bridge: Intel Corporation 5000 Series Chipset Reserved
Registers (rev 31)
00:13.0 Host bridge: Intel Corporation 5000 Series Chipset Reserved
Registers (rev 31)
00:15.0 Host bridge: Intel Corporation 5000 Series Chipset FBD Registers
(rev 31)
00:16.0 Host bridge: Intel Corporation 5000 Series Chipset FBD Registers
(rev 31)
00:1d.0 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #1 (rev 09)
00:1d.1 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #2 (rev 09)
00:1d.2 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #3 (rev 09)
00:1d.7 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset EHCI
USB2 Controller (rev 09)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d9)
00:1f.0 ISA bridge: Intel Corporation 631xESB/632xESB/3100 Chipset LPC
Interface Controller (rev 09)
00:1f.2 SATA controller: Intel Corporation 631xESB/632xESB SATA AHCI
Controller (rev 09)
00:1f.3 SMBus: Intel Corporation 631xESB/632xESB/3100 Chipset SMBus
Controller (rev 09)
01:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Upstream
Port (rev 01)
01:00.3 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express to PCI-X
Bridge (rev 01)
02:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Downstream
Port E1 (rev 01)
02:02.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Downstream
Port E3 (rev 01)
04:00.0 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit Ethernet
Controller (Copper) (rev 01)
04:00.1 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit Ethernet
Controller (Copper) (rev 01)
06:00.0 InfiniBand: Mellanox Technologies MT25204 [InfiniHost III Lx HCA]
(rev 20)
08:01.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02)

output of cat /proc/pci
PCI devices found:
 Bus  0, device   0, function  0:
   Class 0600: PCI device 8086:25c0 (rev 49).
     IRQ 169.
 Bus  0, device   2, function  0:
   Class 0604: PCI device 8086:25f7 (rev 49).
     IRQ 169.
     Master Capable.  No bursts.  Min Gnt=4.
 Bus  0, device   4, function  0:
   Class 0604: PCI device 8086:25f8 (rev 49).
     IRQ 169.
     Master Capable.  No bursts.  Min Gnt=4.
 Bus  0, device   6, function  0:
   Class 0604: PCI device 8086:25f9 (rev 49).
     IRQ 169.
     Master Capable.  No bursts.  Min Gnt=4.
 Bus  0, device   8, function  0:
   Class 0880: PCI device 8086:1a38 (rev 49).
     IRQ 169.
     Non-prefetchable 64 bit memory at 0xfe700000 [0xfe7003ff].
 Bus  0, device  16, function  0:
   Class 0600: PCI device 8086:25f0 (rev 49).
 Bus  0, device  16, function  1:
   Class 0600: PCI device 8086:25f0 (rev 49).
 Bus  0, device  16, function  2:
   Class 0600: PCI device 8086:25f0 (rev 49).
 Bus  0, device  17, function  0:
   Class 0600: PCI device 8086:25f1 (rev 49).
 Bus  0, device  19, function  0:
   Class 0600: PCI device 8086:25f3 (rev 49).
 Bus  0, device  21, function  0:
   Class 0600: PCI device 8086:25f5 (rev 49).
 Bus  0, device  22, function  0:
   Class 0600: PCI device 8086:25f6 (rev 49).
 Bus  0, device  29, function  0:
   Class 0c03: PCI device 8086:2688 (rev 9).
     IRQ 177.
     I/O at 0x1800 [0x181f].
 Bus  0, device  29, function  1:
   Class 0c03: PCI device 8086:2689 (rev 9).
     IRQ 185.
     I/O at 0x1820 [0x183f].
 Bus  0, device  29, function  2:
   Class 0c03: PCI device 8086:268a (rev 9).
     IRQ 193.
     I/O at 0x1840 [0x185f].
 Bus  0, device  29, function  7:
   Class 0c03: PCI device 8086:268c (rev 9).
     IRQ 177.
     Non-prefetchable 32 bit memory at 0xd8e00000 [0xd8e003ff].
 Bus  0, device  30, function  0:
   Class 0604: PCI device 8086:244e (rev 217).
     Master Capable.  No bursts.  Min Gnt=12.
 Bus  0, device  31, function  0:
   Class 0601: PCI device 8086:2670 (rev 9).
 Bus  0, device  31, function  2:
   Class 0106: PCI device 8086:2681 (rev 9).
     IRQ 185.
     I/O at 0x1890 [0x1897].
     I/O at 0x1884 [0x1887].
     I/O at 0x1888 [0x188f].
     I/O at 0x1880 [0x1883].
     I/O at 0x1860 [0x187f].
     Non-prefetchable 32 bit memory at 0xd8e00400 [0xd8e007ff].
 Bus  0, device  31, function  3:
   Class 0c05: PCI device 8086:269b (rev 9).
     IRQ 185.
     I/O at 0x1100 [0x111f].
 Bus  1, device   0, function  0:
   Class 0604: PCI device 8086:3500 (rev 1).
     IRQ 169.
     Master Capable.  No bursts.  Min Gnt=4.
 Bus  1, device   0, function  3:
   Class 0604: PCI device 8086:350c (rev 1).
     Master Capable.  No bursts.  Min Gnt=4.
 Bus  2, device   0, function  0:
   Class 0604: PCI device 8086:3510 (rev 1).
     IRQ 169.
     Master Capable.  No bursts.  Min Gnt=4.
 Bus  2, device   2, function  0:
   Class 0604: PCI device 8086:3518 (rev 1).
     IRQ 193.
     Master Capable.  No bursts.  Min Gnt=4.
 Bus  4, device   0, function  0:
   Class 0200: PCI device 8086:1096 (rev 1).
     IRQ 209.
     Non-prefetchable 32 bit memory at 0xd8a00000 [0xd8a1ffff].
     I/O at 0x2000 [0x201f].
 Bus  4, device   0, function  1:
   Class 0200: PCI device 8086:1096 (rev 1).
     IRQ 185.
     Non-prefetchable 32 bit memory at 0xd8a20000 [0xd8a3ffff].
     I/O at 0x2020 [0x203f].
 Bus  6, device   0, function  0:
   Class 0c06: PCI device 15b3:6274 (rev 32).
     IRQ 169.
     Non-prefetchable 64 bit memory at 0xd8800000 [0xd88fffff].
     Prefetchable 64 bit memory at 0xd8000000 [0xd87fffff].
 Bus  8, device   1, function  0:
   Class 0300: PCI device 1002:515e (rev 2).
     IRQ 193.
     Master Capable.  Latency=66.  Min Gnt=8.
     Prefetchable 32 bit memory at 0xd0000000 [0xd7ffffff].
     I/O at 0x3000 [0x30ff].
     Non-prefetchable 32 bit memory at 0xd8b00000 [0xd8b0ffff].



Is UYOK in production?
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to