>
> On 2/4/2015 6:29 PM, [email protected] wrote:
> > From: Ira Weiny <[email protected]>
> >
> > The following patch series modifies the kernel MAD processing
> > (ib_mad/ib_umad) and related interfaces to send and receive Intel
> > Omni-Path Architecture MADs on devices which support them.
> >
> > In addition to supporting some IBTA management classes, OPA devices
> > use MADs with lengths up to 2K. These "jumbo" MADs increase the
> > performance of management traffic.
> >
> > To distinguish IBTA MADs from OPA MADs a new Base Version is introduced.
>
> With your recent changes, I don't think that statement above is strictly true
> any
> longer. While OPA does use a different base version for it's jumbo MADs,
> aren't
> OPA MADs distinguished from IBTA MADs by the new OPA MAD device
> capability bit ?
>
True.
However, OPA MADs with a base version of 0x1 are compatible with and therefore
can be processed by the same code as IBTA MADs.
If I need to respin the series I will update the comment.
>
> What performance tests were run in terms of IBTA MADs ?
Sorry for the delay. As there have been a couple of versions of this series
since I ran those tests in December I took the time to re-run these tests.
OpenSM (sweep time) and infiniband-diag (iblinkinfo and saquery) were timed on
my small cluster with no noticeable change in performance. But this is not the
best test as I only have 6 or so nodes on 2 switches.
For example iblinkinfo runs very quickly:
[root@phcppriv12 OPENIB_FF]# time iblinkinfo > /dev/null
real 0m0.072s
user 0m0.002s
sys 0m0.041s
The better test we have at this small scale are a couple of tools (closed
source) which send SMA and PMA packets as rapidly as possible.
Those showed no difference in performance.
For example I ran these tools with 3 different kernels 1) "stock roland", 2)
the series in question up to the kmalloc patch, and final 3) then the full
OPA series.
Here is a summary of the results:
Roland for-next (ecb7b12) up to kmalloc patch
full OPA patch set
SMA 21072 21324
21381
SMA rcv 17139 17329
17303
PMA 24159.4 24401.2
24166.5
PMA rcv 24159.4 24401.2
24166.5
NOTE: The results shown above are specifically shown without units as I am not
allowed to publish performance numbers. However larger numbers equal better
performance.
The numbers are quite repeatable and as you can see are all close to each other
from kernel to kernel.
The remote node for these tests was running the stock RHEL7 kernel. All
software and hardware was held constant except for the kernel patches in
question.
Ira
>
> -- Hal
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html