Lukas Razik <[email protected]> wrote
> Roland Dreier <[email protected]> wrote:

> 
>>  If it's possible, it would be useful to try with the vanilla kernel
>>  and all upstream
>>  modules too.  Otherwise I can't rule out the possiblity that we're 
>>  chasing a
>>  bug that OFED introduces.
>
> BTW: I'm still trying to get the new firmware for my adapters. Maybe it 
> clicks tomorrow...
> 

Hello Vladimir and Roland,

with kindly help of the Mellanox support team I updated the FW of my adapters.
It seems that the firmware wasn't the problem. I want to summarize the facts:

I've a
- Sun Enterprise T5120 SPARC Server
- Debian 6.0.3
- linux-2.6.39.4 vanilla and sparc64
- OFED-1.5.4-rc4

If I execute 
# ibv_devinfo 
hca_id: mlx4_0
        transport:                      InfiniBand (0)
        fw_ver:                         2.9.1000
        node_guid:                      0003:ba00:0100:b1f0
        sys_image_guid:                 0003:ba00:0100:b1f3
        vendor_id:                      0x03ba
        vendor_part_id:                 25418
        hw_ver:                         0xA0
        board_id:                       SUN0070000001
        phys_port_cnt:                  2 

I get this BUG message from the kernel:
[ 1032.739077] swap_free: Bad swap file entry 100005e000061800
[ 1032.739206] BUG: Bad page map in process ibv_devinfo  pte:bc0000c300104848 
pmd:00fc947c
[ 1032.739324] addr:fffff80100114000 vm_flags:000844fa anon_vma:          
(null) mapping:fffff807f5d1c930 index:6180082
[ 1032.739503] vma->vm_file->f_op->mmap: ib_uverbs_mmap+0x8/0x38 [ib_uverbs]
[ 1032.739551] Call Trace:
[ 1032.739589]  [00000000004cd430] unmap_vmas+0x514/0x7f4
[ 1032.739641]  [00000000004d114c] unmap_region+0xb4/0x164
[ 1032.739690]  [00000000004d2198] do_munmap+0x2a8/0x31c
[ 1032.739744]  [000000000042d340] SyS_64_munmap+0x88/0xa8
[ 1032.739800]  [0000000000406154] linux_sparc_syscall+0x34/0x44
[ 1032.739845] Disabling lock debugging due to kernel taint

I also get such BUG messages when I try to use OpenMPI over Infiniband and 
mpirun dies.
Same with:
- OFED-1.5.3.2 and linux-2.6.39.4 vanilla
- OFED-1.5.3.2 and linux-2.6.38

That's the whole conversation:
http://thread.gmane.org/gmane.linux.drivers.rdma/10157/focus=10210

---

BTW:
I can't build ofa-1.5.4 kernel modules for linux-3.1.1:
http://thread.gmane.org/gmane.linux.drivers.rdma/10192

Regards
Lukas
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to