Re: [ewg] RDS problematic on RC2

2008-01-20 Thread Or Gerlitz
Ralph Campbell wrote: Attached is the patch I sent to Olaf. It basically exchanges calls like dma_map_sg() to ib_dma_map_sg() so that the InfiniPath driver can intercept the DMA mapping calls and use kernel virtual addresses instead of physical addresses. The InfiniPath driver uses the host CPU t

Re: [ewg] RDS problematic on RC2

2008-01-17 Thread Ralph Campbell
Attached is the patch I sent to Olaf. It basically exchanges calls like dma_map_sg() to ib_dma_map_sg() so that the InfiniPath driver can intercept the DMA mapping calls and use kernel virtual addresses instead of physical addresses. The InfiniPath driver uses the host CPU to copy data in most case

Re: [ewg] RDS problematic on RC2

2008-01-17 Thread Richard Frank
<< If the TCP part is entirely non-working, it might be better to disable it for now rather than have it crash the machine. So far, I have never gotten it to function correctly and it crashes some machines almost immediately. My vote is to disable TCP support (return not supported) - at least

Re: [ewg] RDS problematic on RC2

2008-01-17 Thread Or Gerlitz
Johann George wrote: Oh, and if you're using RDMA - does this happen to be with qlogic HCAs? If so, I just received a patch from Ralph Campbell with some fixes to the way we set up out DMA mapping. RDS in OFED 1.3 does not currently work on the QLogic HCAs due to the way you are setting up DMA

Re: [ewg] RDS problematic on RC2

2008-01-17 Thread Olaf Kirch
On Thursday 17 January 2008 11:57, Johann George wrote: > > That's a remote invalid request error. Were you testing > > with RDMA or without? > > We were using the version that runs over IB. Well, yes. But you can do that with ordinary SENDs, or you can enable RDMA for large data blobs as well. B

Re: [ewg] RDS problematic on RC2

2008-01-17 Thread Johann George
> Oh, and if you're using RDMA - does this happen to be with > qlogic HCAs? If so, I just received a patch from Ralph > Campbell with some fixes to the way we set up out DMA > mapping. RDS in OFED 1.3 does not currently work on the QLogic HCAs due to the way you are setting up DMA mapping. We al

Re: [ewg] RDS problematic on RC2

2008-01-17 Thread Johann George
> That's a remote invalid request error. Were you testing > with RDMA or without? We were using the version that runs over IB. > What user application were you using for testing? qperf. Unfortunately the version that is included in OFED 1.3 RC2 is old due to a problem that Vlad just discovered

Re: [ewg] RDS problematic on RC2

2008-01-16 Thread Olaf Kirch
On Thursday 17 January 2008 04:15, Johann George wrote: > RDS/IB: completion on 10.1.1.205 had status 9, disconnecting and reconnecting > > Note that this is using RDS over IB. Our minimal experience with the > non-IB version of RDS was worse. We only tried it with RC1 and it > crashed one of th

Re: [ewg] RDS problematic on RC2

2008-01-16 Thread Olaf Kirch
On Thursday 17 January 2008 04:15, Johann George wrote: > We've been testing the OFED 1.3 pre-releases on a 12 node cluster here > at UNH-IOL. RDS seemed largely functional (other than problems we > were aware of) on OFED 1.3 RC1. When we installed RC2, RDS stopped > working. A dmesg indicates t

Re: [ewg] RDS problematic on RC2

2008-01-16 Thread Vladimir Sokolovsky
Johann George wrote: We've been testing the OFED 1.3 pre-releases on a 12 node cluster here at UNH-IOL. RDS seemed largely functional (other than problems we were aware of) on OFED 1.3 RC1. When we installed RC2, RDS stopped working. A dmesg indicates the following message repeatedly on the co

Re: [ewg] RDS problematic on RC2

2008-01-16 Thread Richard Frank
copying rds-dev. Johann George wrote: We've been testing the OFED 1.3 pre-releases on a 12 node cluster here at UNH-IOL. RDS seemed largely functional (other than problems we were aware of) on OFED 1.3 RC1. When we installed RC2, RDS stopped working. A dmesg indicates the following message re

[ewg] RDS problematic on RC2

2008-01-16 Thread Johann George
We've been testing the OFED 1.3 pre-releases on a 12 node cluster here at UNH-IOL. RDS seemed largely functional (other than problems we were aware of) on OFED 1.3 RC1. When we installed RC2, RDS stopped working. A dmesg indicates the following message repeatedly on the console: RDS/IB: complet