An alternate explanation is that the provider is not setting device->max_sge_rd properly. rdma_read_chunks_lcl() seems to be the only thing in my copy of the kernel tree that relies on that value. I’ve reproduced the local length errors with CX-2 and CX-3 Pro, which both set that field to 32. If I artificially set that field to 30, I don’t see any issue. Is commit 18ebd40773bf correct?
See my latest patchset "Handle mlx4 max_sge_rd correctly" (v2). It fixes exactly this. Now you will be able to add your "Tested-by:" tag ;) It on Doug to take it... -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to [email protected] More majordomo info at http://vger.kernel.org/majordomo-info.html
