Le 28/02/2011 17:30, Rolf vandeVaart a écrit : > Hi Brice: > Yes, I have tired OMPI 1.5 with gpudirect and it worked for me. You > definitely need the patch or you will see the behavior just as you described, > a hang. One thing you could try is disabling the large message RDMA in OMPI > and see if that works. That can be done by adjusting the openib BTL flags. > > -- mca btl_openib_flags 304 > > Rolf >
Thanks Rolf. Adding this mca parameter worked-around the hang indeed. The kernel is supposed to be properly patched for gpudirect. Are you aware of anything else we might need to make this work? Do we need to rebuild some OFED kernel modules for instance? Also, is there any reliable/easy way to check if gpudirect works in our kernel ? (we had to manually fix the gpudirect patch for SLES11). Brice