Is the driver loaded ? ie., do an /sbin/lsmod to see.

Also are there any messages that would indicate a 
problem when you do a dmesg.



-----Original Message-----
From: ewg-boun...@lists.openfabrics.org 
[mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Jeff Squyres
Sent: Wednesday, May 13, 2009 11:34 AM
To: OpenFabrics General; OpenFabrics EWG
Subject: [ewg] /dev/infiniband/rdma_cm not created

I'm running on rhel4u6 with the 1.4.1 nightly from last night and  
sometimes /dev/infiniband/rdma_cm is not created.  I can see its entry  
in /etc/udev/rules.d/90-ib.rules:

KERNEL="umad*", NAME="infiniband/%k"
KERNEL="issm*", NAME="infiniband/%k"
KERNEL="ucm*", NAME="infiniband/%k", MODE="0666"
KERNEL="uverbs*", NAME="infiniband/%k", MODE="0666"
KERNEL="ucma", NAME="infiniband/%k", MODE="0666"
KERNEL="rdma_cm", NAME="infiniband/%k", MODE="0666"

But only some of these are created:

[11:29] svbu-mpi005:/etc/udev/rules.d % l /dev/infiniband/
total 0
drwxr-xr-x   2 root root      120 May 13 02:39 ./
drwxr-xr-x  10 root root     5740 May 13 09:39 ../
crw-------   1 root root 231,  64 May 13 02:39 issm0
crw-------   1 root root 231,   0 May 13 02:39 umad0
crw-rw-rw-   1 root root 231, 192 May 13 02:39 uverbs0
crw-rw-rw-   1 root root 231, 193 May 13 02:39 uverbs1
[11:29] svbu-mpi005:/etc/udev/rules.d %

I have both an IB HCA and an iWARP RNIC in this server:

hca_id: mthca0
        fw_ver:                         1.2.917
        node_guid:                      0005:ad00:0008:bd60
        sys_image_guid:                 0005:ad00:0100:d050
        vendor_id:                      0x05ad
        vendor_part_id:                 25204
        hw_ver:                         0xA0
        board_id:                       MT_03B0120002
        phys_port_cnt:                  1
                port:   1
                        state:                  PORT_ACTIVE (4)
                        max_mtu:                2048 (4)
                        active_mtu:             2048 (4)
                        sm_lid:                 2
                        port_lid:               34
                        port_lmc:               0x00

hca_id: nes0
        node_guid:                      0012:5502:b58c:0000
        sys_image_guid:                 0012:5502:b58c:0000
        vendor_id:                      0x1255
        vendor_part_id:                 256
        hw_ver:                         0x5
        board_id:                       NES020 Board ID
        phys_port_cnt:                  1
                port:   1
                        state:                  PORT_ACTIVE (4)
                        max_mtu:                2048 (4)
                        active_mtu:             2048 (4)
                        sm_lid:                 0
                        port_lid:               1
                        port_lmc:               0x00

I don't see any obvious errors occurring in syslog or dmesg.

What could cause this failure?

-- 
Jeff Squyres
Cisco Systems

_______________________________________________
ewg mailing list
e...@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
_______________________________________________
general mailing list
general@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to