I'm running on rhel4u6 with the 1.4.1 nightly from last night and sometimes /dev/infiniband/rdma_cm is not created. I can see its entry in /etc/udev/rules.d/90-ib.rules:

KERNEL="umad*", NAME="infiniband/%k"
KERNEL="issm*", NAME="infiniband/%k"
KERNEL="ucm*", NAME="infiniband/%k", MODE="0666"
KERNEL="uverbs*", NAME="infiniband/%k", MODE="0666"
KERNEL="ucma", NAME="infiniband/%k", MODE="0666"
KERNEL="rdma_cm", NAME="infiniband/%k", MODE="0666"

But only some of these are created:

[11:29] svbu-mpi005:/etc/udev/rules.d % l /dev/infiniband/
total 0
drwxr-xr-x   2 root root      120 May 13 02:39 ./
drwxr-xr-x  10 root root     5740 May 13 09:39 ../
crw-------   1 root root 231,  64 May 13 02:39 issm0
crw-------   1 root root 231,   0 May 13 02:39 umad0
crw-rw-rw-   1 root root 231, 192 May 13 02:39 uverbs0
crw-rw-rw-   1 root root 231, 193 May 13 02:39 uverbs1
[11:29] svbu-mpi005:/etc/udev/rules.d %

I have both an IB HCA and an iWARP RNIC in this server:

hca_id: mthca0
        fw_ver:                         1.2.917
        node_guid:                      0005:ad00:0008:bd60
        sys_image_guid:                 0005:ad00:0100:d050
        vendor_id:                      0x05ad
        vendor_part_id:                 25204
        hw_ver:                         0xA0
        board_id:                       MT_03B0120002
        phys_port_cnt:                  1
                port:   1
                        state:                  PORT_ACTIVE (4)
                        max_mtu:                2048 (4)
                        active_mtu:             2048 (4)
                        sm_lid:                 2
                        port_lid:               34
                        port_lmc:               0x00

hca_id: nes0
        node_guid:                      0012:5502:b58c:0000
        sys_image_guid:                 0012:5502:b58c:0000
        vendor_id:                      0x1255
        vendor_part_id:                 256
        hw_ver:                         0x5
        board_id:                       NES020 Board ID
        phys_port_cnt:                  1
                port:   1
                        state:                  PORT_ACTIVE (4)
                        max_mtu:                2048 (4)
                        active_mtu:             2048 (4)
                        sm_lid:                 0
                        port_lid:               1
                        port_lmc:               0x00

I don't see any obvious errors occurring in syslog or dmesg.

What could cause this failure?

--
Jeff Squyres
Cisco Systems

_______________________________________________
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg

Reply via email to