[ewg] Bug in IMB included in OFED

2009-05-13 Thread Jeff Squyres
There is a bug in the IMB MPI test suite that is included in OFED (the mpitests SRPM). Who is the maintainer for that? I would file a bug on bugzilla, but there's no category for the MPI tests. It needs the following patch: --- IMB-3.1/src/IMB_window.c.~1~2009-05-13 14:44:42.00

Re: [ofa-general] Re: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Jeff Squyres
Ok, I figured it out. I have some creative /etc/sysconfig/network- script/ifcfg-ib* scripts that may choose to do nothing if no device is present (or some other esoteric, specific-to-jeffs-cluster criteria is met) -- they call "exit 0" in this case. This apparently causes the top-level /et

Re: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Jeff Squyres
On May 13, 2009, at 3:12 PM, Woodruff, Robert J wrote: Check to see if some other driver failed to load. I think I have seen before that if another driver fails to load, the start script bails out and does not load the other drivers. Perhaps try doing a /etc/init.d/openibd restart manually to s

Re: [ofa-general] Re: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Jeff Squyres
On May 13, 2009, at 3:03 PM, Davis, Arlin R wrote: >FWIW, I see the following in /etc/infiniband/openibd.conf: > > ># Load RDMA_CM module >RDMA_CM_LOAD=yes is RDMA_UCM_LOAD=yes ? Yes, sorry I didn't see that one first time around: # Load RDMA_UCM module RDMA_UCM_LOAD=yes What do you see w

RE: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Woodruff, Robert J
Check to see if some other driver failed to load. I think I have seen before that if another driver fails to load, the start script bails out and does not load the other drivers. Perhaps try doing a /etc/init.d/openibd restart manually to see if something is failing to load. -Original Messa

RE: [ofa-general] Re: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Davis, Arlin R
>FWIW, I see the following in /etc/infiniband/openibd.conf: > > ># Load RDMA_CM module >RDMA_CM_LOAD=yes > is RDMA_UCM_LOAD=yes ? What do you see with "modinfo rdma_cm rdma_ucm" ?___ ewg mailing list ewg@lists.openfabrics.org http://lists.openfabrics

Re: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Jeff Squyres
On May 13, 2009, at 2:54 PM, Jeff Squyres wrote: [11:51] svbu-mpi005:/etc/udev/rules.d % /sbin/lsmod | grep rdma [11:51] svbu-mpi005:/etc/udev/rules.d % What would cause it to not be loaded? I *assumed* (but didn't check) that it is loaded as part of OFED's /etc/init.d/openibd. Is that co

Re: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Jeff Squyres
On May 13, 2009, at 2:39 PM, Woodruff, Robert J wrote: Is the driver loaded ? ie., do an /sbin/lsmod to see. Ah ha -- no, it is not: [11:51] svbu-mpi005:/etc/udev/rules.d % /sbin/lsmod | grep rdma [11:51] svbu-mpi005:/etc/udev/rules.d % What would cause it to not be loaded? I *assumed* (bu

RE: [ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Woodruff, Robert J
Is the driver loaded ? ie., do an /sbin/lsmod to see. Also are there any messages that would indicate a problem when you do a dmesg. -Original Message- From: ewg-boun...@lists.openfabrics.org [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Jeff Squyres Sent: Wednesday, May 13,

[ewg] /dev/infiniband/rdma_cm not created

2009-05-13 Thread Jeff Squyres
I'm running on rhel4u6 with the 1.4.1 nightly from last night and sometimes /dev/infiniband/rdma_cm is not created. I can see its entry in /etc/udev/rules.d/90-ib.rules: KERNEL="umad*", NAME="infiniband/%k" KERNEL="issm*", NAME="infiniband/%k" KERNEL="ucm*", NAME="infiniband/%k", MODE="0666"

[ewg] RPM version numbers are the same

2009-05-13 Thread Jeff Squyres
Why are the RPM version numbers the same between rc5 and the current 1.4.1 nightlies? -- Jeff Squyres Cisco Systems ___ ewg mailing list ewg@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg

[ewg] Re: OFED 1.4.1-rc5 symbol disagreements

2009-05-13 Thread Jon Mason
On Wed, May 13, 2009 at 10:19:55AM +0300, Or Gerlitz wrote: > Hi OGC gang, > > May you guys spare the general list from your ofed related postings? I > don't see any reason for them to be sent to this list nor how does it > serve you, thx Mailing list police? If you do not see any purpose to

[ewg] Re: OFED 1.4.1-rc5 symbol disagreements

2009-05-13 Thread Or Gerlitz
Hi OGC gang, May you guys spare the general list from your ofed related postings? I don't see any reason for them to be sent to this list nor how does it serve you, thx Or. Brian M. Rzycki wrote: I downloaded and installed OFED-1.4.1-rc5.tgz on the machine. I configured one of the Mellanox