or verbs

Scott Atchley Fri, 24 Nov 2006 11:35:48 -0800

If we can
make a case that this truly provides higher throughput/lower
latency than IPoIB or SDP.  Eric/Peter, do you guys have any
measurements that may substantiate this performance assumption?
Performance comparisons are fraught with danger viz.
hardware/firmware/software revisions. You really have to rundedicated
tests on the same hardware before you can compare with confidence.
One thing I haven't mentioned is that LNET has both kernel anduserspaceimplementations. These share the bulk of the network-independentcode, butthe LND implementations are not shared. Currently we only supportTCP/IPand the native Cray XT3 network in userspace. It would be quiteeasy to adda system call interface to export the kernel LNET API to userspace,butdedicated userspace LND versions would be required to deliver thelowest
latency you'd expect from OS bypass.

                Cheers,
                        Eric

In the MX (Myrinet Express) LND, the MX API itself is identical inkernel-space vs user-space. The differences are in the LND only andwould refer to the thread creation and synchronization methods(kernel threads vs pthreads, spinlocks vs pthread_mutex_lock(),etc.). Latency and bandwidth should be equivalent for user vs kernelspace with the exception below.

The only performance difference between MX in the kernel and user-space is handling of multiple segments. In the kernel, we madeoptimizations for Lustre to handle 256 kernel pages, for example. Inuser-space, MX is not similarly optimized and would try to copy theminto a single buffer before sending. This cuts bandwidth by half on10 Gb/s fabrics. If LNET were pushed into user-space, we would lookat providing this optimization.


Scott

_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Re: [Lustre-discuss] RE: portals/lnet as an abstraction over rdma and/or verbs

Reply via email to