RE: CUDA not working with ib_write_bw

Steve Wise Wed, 04 Feb 2015 07:17:02 -0800

I want to RDMA to/from gpu memory from a remote peer.  So the data arrives at 
the RDMA device from the wire, and is DMA'd directly
to GPU memory of the peer adapter in the system.




> -----Original Message-----
> From: Zhangfengwei [mailto:[email protected]]
> Sent: Wednesday, February 04, 2015 12:42 AM
> To: Steve Wise; [email protected]
> Cc: [email protected]; Stephen Bates
> Subject: 答复: CUDA not working with ib_write_bw
> 
> Hi Steve，
> 
> Do you want to transfer the data to the gpu buffer directly ? I guess the DMA 
> seems not to do this all by itself.
> 
> -----邮件原件-----
> 发件人: [email protected] 
> [mailto:[email protected]] 代表 Steve Wise
> 发送时间: 2015年2月2日 23:59
> 收件人: [email protected]
> 抄送: [email protected]; Stephen Bates
> 主题: CUDA not working with ib_write_bw
> 
> Hey Gil,
> 
> I'm trying to test iWARP RDMA <-> GPU memory and I compiled CUDA into the 
> top-o-tree perftest repo.  My Nvidia setup is working
> because I have verified it with another gpu rdma package (donard from pmc).  
> But when using ib_write_bw the server gets an error
> registering the gpu memory with the device.  Below is the output from 
> ib_write_bw.  I instrumented the kernel registration path
and I find
> that get_user_pages() is returning -14 (-EFAULT) when called by ib_umem_get().
> 
> Q:  Is this supposed to work with the upstream RDMA drivers?   I'm using a 
> 3.16.3 kernel.org kernel.
> 
> Thanks,
> 
> Steve
> ---
> 
> [root@stevo1 perftest]# ./ib_write_bw -R --use_cuda
> 
> ************************************
> * Waiting for client to connect... *
> ************************************
> ---------------------------------------------------------------------------------------
>                     RDMA_Write BW Test
>  Dual-port       : OFF          Device         : cxgb4_1
>  Number of qps   : 1            Transport type : IW
>  Connection type : RC           Using SRQ      : OFF
>  CQ Moderation   : 100
>  Mtu             : 1024[B]
>  Link type       : Ethernet
>  Gid index       : 0
>  Max inline data : 0[B]
>  rdma_cm QPs     : ON
>  Data ex. method : rdma_cm
> ---------------------------------------------------------------------------------------
>  Waiting for client rdma_cm QP to connect  Please run the same command with 
> the IB/RoCE interface IP
> ---------------------------------------------------------------------------------------
> initializing CUDA
> There is 1 device supporting CUDA
> [pid = 14124, dev = 0] device name = [Tesla K20Xm] creating CUDA Ctx making 
> it the current CUDA Ctx
> cuMemAlloc() of a 131072 bytes GPU buffer allocated GPU buffer address at 
> 0000001304260000 pointer=0x1304260000 Couldn't allocate
> MR  Unable to create the resources needed by comm struct Unable to perform 
> rdma_client function
> [root@stevo1 perftest]#
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the 
> body of a message to [email protected] More
> majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

RE: CUDA not working with ib_write_bw

Reply via email to